Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2506 |
Symbol | |
ID | 4022997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2801469 |
End bp | 2802560 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637962699 |
Product | hypothetical protein |
Protein accession | YP_569637 |
Protein GI | 91976978 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1509] Lysine 2,3-aminomutase |
TIGRFAM ID | [TIGR00238] KamA family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.524387 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGG TCACCCGACT GCAAACGCCC ACGGCGACGA CGCTGCGCCA ACCCGCCGAA CTGATCGCGC AGGGCTTGGC GCCGGCCGAC GCACAGGACG ATCTCGAGCA GGTCGCGCAG CGCTACGCGA TCGCGGTGAC GCCGGACATC GCAGCGCTGA TCGATCCCGA CGACCCGGAC GATCCGATCG CGCGGCAGTA CATCCCGCGC GCCGAGGAGC TGGCGACGCT TCCGATCGAG CGCGACGATC CGATCGGCGA TGGCGCCCAT TCGCCAGTTG AAGGCATCGT GCATCGCCAT CGCGACCGCG TGCTGCTCAA GCTGGTGCAT GTCTGCGCGG TGTATTGCCG GTTCTGTTTC CGCCGCGAGA TGGTTGGGCC GGGCAAGGAC AATTCGCTGT CGGGCGACGC GACCGCAGCG GCGCTCGGCT ACATCCGCGC GCATCCGGAA ATCTGGGAAG TGATCCTGAC CGGCGGCGAT CCCTTGATGC TGTCGCCGCG CAGGCTCGCC GACATCATGG CCGAGCTGGC GACGATCGAT CATGTCAGGA TCATCCGCTT CCATACGCGA CTGCCTGTGG CCGAGCCGGC GCGGATCAGT GCGGAGCTGG TGCGGGCGTT GCGGGTCGAG GGCAAGACCG TTTGGATGGC GCTCCATGCC AATCATCCGC GCGAGCTCAC CACGGCTGCG CGGGCGGCGT GCGCGCGCAT CATCGATGCC GGCATTCCGA TGGTCAGCCA ATCGGTGCTG CTGGCCGGCG TCAATGACGA CGCAGCCACG CTGGAAGCGC TGATGCGCGT CTTCGTCGAA TGCCGGATCA AGCCGTATTA CCTGCATCAC GGCGACCTCG CGCCGGGCAC CGCGCATCTG CGCACCAGCC TCGCGGAAGG GCAGGCGCTG ATGCGGGCGC TGCGCGGCCG CGTCTCTGGT CTGTGTCAGC CCGAATACGT GCTCGACATT CCGGGCGGCT ACGGCAAAGC ACCGGTTGGC CCCAACTATT TGGCCGCGGA TGATGGAACG GCAGCGGATT CGCGCTATCG TGTCTCCGAC TATTGCGGCG ACGTCCATCT GTATCCGCCC AAGCCGGCTT GA
|
Protein sequence | MNKVTRLQTP TATTLRQPAE LIAQGLAPAD AQDDLEQVAQ RYAIAVTPDI AALIDPDDPD DPIARQYIPR AEELATLPIE RDDPIGDGAH SPVEGIVHRH RDRVLLKLVH VCAVYCRFCF RREMVGPGKD NSLSGDATAA ALGYIRAHPE IWEVILTGGD PLMLSPRRLA DIMAELATID HVRIIRFHTR LPVAEPARIS AELVRALRVE GKTVWMALHA NHPRELTTAA RAACARIIDA GIPMVSQSVL LAGVNDDAAT LEALMRVFVE CRIKPYYLHH GDLAPGTAHL RTSLAEGQAL MRALRGRVSG LCQPEYVLDI PGGYGKAPVG PNYLAADDGT AADSRYRVSD YCGDVHLYPP KPA
|
| |