Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A1032 |
Symbol | |
ID | 3833493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 1221288 |
End bp | 1222679 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637825121 |
Product | ethanolamine ammonia-lyase heavy chain |
Protein accession | YP_426120 |
Protein GI | 83592368 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4303] Ethanolamine ammonia-lyase, large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATTGT ATCGGGCGAC GGTGGGGGGA ACACGCTACG ACTTCGCCGA TTTGCGAACG GTGATGGCCT GCGCCTCGCC GCGCCGCTCG GGCGACGAAC TGGCCGGACT GGCCGCCGAG AGCGACGCCC AGCGCATGGC GGCCCGCCTC GTTCTCGCCG ATCTGCCCCT GCGCGCCTTC CTTGACACGC CGCTTGTTCC CTATGAGAGC GACGAAGTCA CCCGGCTGAT CATCGATACC CACGATGGCG CCGCCTTCGC CCCGGTCGCC AGCCTGACCG TTGGCGGCTT TCGCGACTGG CTGCTGTCCT ATGCCGCCGA TAGCGCCGCC CTTGCCGCCC TCGCCCCCGG ACTGACCCCG GAAATGGTCG CCGCCGTCAG CAAGCTGATG CGCAACGCCG ATCTGATCGC CGTGGCGGCG AAATGCCAGG TGATCACCGG CTTTCGCACG ACGCTGGGCT TGCCCGGCCG TCTGGCCAGC CGCCTTCAGC CCAACCACCC CACCGATGAT CCCGCCGGTA TCGCCGCCTC GACCCTTGAC GGTCTGCTGT TTGGCATGGG CGACGCGGTG ATCGGCATCA ATCCGGCGAC CGATAACGTC GGCGCCTGCG TCACCTTGCT TGAGATGCTG GACGCCGTGC GCCAGCGCTT CGACATCCCC AGCCAATCTT GCGTGCTGAC CCATGTCACC AACAGCATCG AGGCGATCAA CCGCGGCGCC CCCCTCGACC TTGTCTTCCA GTCGGTCGCC GGCACCGAGG CGGCCAATGC CGGCTTTGGC ATCAGCTTGT CGCTGCTGGG CGAAGCCCGC GAGGCGGCCT TGTCGCTGCG GCGCGGCACG GTGGGCGACA ACGTCATGTA TTTCGAGACC GGCCAGGGCG CGGCGCTGTC GGCCGACGCC CACCACGGCG TCGATCAGCA AACCGTCGAG GTCCGCGCCT ATGCGGTCTG CCGGGCCTTC AAGCCGCTGA TCGTCAATAC GGTCGTCGGC TTTATCGGCC CGGAATATCT GTATGATTCC AAGCAGATCA TCCGCGCCGG CCTGGAAGAT CACTGCTGCG GCAAGCTGCT TGGCCTGCCG ATGGGGGTGG ATGTCTGCTA CACCAACCAC GCCGAGGCCG ATCAGGACGA TATGGATACC CTGATGACCC TGCTTGGCGT CGCCGGGGTC ACCTTCCTGA TCGGCGTGCC CGGCGCCGAT GACGTGATGC TCAATTATCA GAGCCTGTCC TATCACGACA TCCTTGGCCT GCGTCATCTG CTTGACCGCC GCCCCGCCCC CGAATTCGCC GACTGGCTCG CGCGCATGGG CATGAGCGAC GCCGGCGGAC GCCTGCCGCC GCTCGATGCC AGCGCCCCGG CCCTGCGTCG CCTTCTGGCT TCGGGAGGCT GA
|
Protein sequence | MGLYRATVGG TRYDFADLRT VMACASPRRS GDELAGLAAE SDAQRMAARL VLADLPLRAF LDTPLVPYES DEVTRLIIDT HDGAAFAPVA SLTVGGFRDW LLSYAADSAA LAALAPGLTP EMVAAVSKLM RNADLIAVAA KCQVITGFRT TLGLPGRLAS RLQPNHPTDD PAGIAASTLD GLLFGMGDAV IGINPATDNV GACVTLLEML DAVRQRFDIP SQSCVLTHVT NSIEAINRGA PLDLVFQSVA GTEAANAGFG ISLSLLGEAR EAALSLRRGT VGDNVMYFET GQGAALSADA HHGVDQQTVE VRAYAVCRAF KPLIVNTVVG FIGPEYLYDS KQIIRAGLED HCCGKLLGLP MGVDVCYTNH AEADQDDMDT LMTLLGVAGV TFLIGVPGAD DVMLNYQSLS YHDILGLRHL LDRRPAPEFA DWLARMGMSD AGGRLPPLDA SAPALRRLLA SGG
|
| |