Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_3618 |
Symbol | |
ID | 5166677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 4241221 |
End bp | 4242309 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640551103 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_001232345 |
Protein GI | 148265639 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000276729 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGAT ATTTATATTT CTTACATAAC AAAAAGTTGA GCCTGCTTCT GGTTACCGGA TTATTACTAC TTGCGCCGGT TTTACGATTC GCATTTTTCC TCACCACATC TGCCGGCGAC GGCAGGAACG TGCAGATGCT GGACATCGGC CACGGTTCAA GTCCGGGGAA AATGGCTGCT GACCTCGAAA CGAAGAAAAT CATCTCCAGC GCCAGGCTGT TTACCCTCTA CACCCGATTC AGCGGCGCCG ACGCCAGATT GAAAGCCGGA CTTTACCAGT TCAACGACGG CATGAAACCA ACGGAAATCG TGCATAAGAT GGTGGCCGGA GATGTTTACC TCCGTCTCTT TGCCCTGCCC GAAGGATACT CCACATACCA GGCGGCGGAA CTGCTCCAGT CCCGCAGGTT TTTCAGCAAG GAATCGTTCC TCAAGCAGTG CGTAAACAGG AAACTACTCG CTGAACTCGG CATTCCGGGC AAAAGCGTTG AAGGCTACCT CTATCCCGGC GCCTACAACA TCCCCCCGAA CATGGACGAA GCTGAGCTGA TCCGGCAGAT GGTGCGGAAG TTCAACGAGG TGTATGCGGA CAAGTTCGAC GACCGGGCAA AAAAACTGGC AATGAACCGC CATAAAGTTC TGACCCTGGC CTCGATGATT GAGAAGGAGG CAGTCGACCC CTCCGAGCGC CCCATCATCT CTGCCGTCTT TTACAACCGG CTGAAAAAGG GGATGCGGCT GCAGAGCGAC CCGACCGCTG TCTACGGTGT GCGTGCATTT GCCGGCAAGG TGTCGAAGCA GGACATCATG CGTCACTCCG ATTACAACAC CTATCTGATA AACGGTATCC CCCCGGGACC CATAGGCAAT CCGAGCAGCG CGGCCATCGA AGCGGTTCTC AGCCCGGCCC AATGCGACTA CCTCTACTTC GTGGCGAAAA AGGACGGCAA TCACTTTTTT TCCAAAAACC TGGAAGAACA TAACCAGGCA GTGAACCGAT ATCTGAAATC TTCCGCAGCC GCTCCTCCAG CAACGCAACA CATCGCGGGG TACACGAATG ACCAGCCGAA TCTTACTGGC AGAAGATAA
|
Protein sequence | MKRYLYFLHN KKLSLLLVTG LLLLAPVLRF AFFLTTSAGD GRNVQMLDIG HGSSPGKMAA DLETKKIISS ARLFTLYTRF SGADARLKAG LYQFNDGMKP TEIVHKMVAG DVYLRLFALP EGYSTYQAAE LLQSRRFFSK ESFLKQCVNR KLLAELGIPG KSVEGYLYPG AYNIPPNMDE AELIRQMVRK FNEVYADKFD DRAKKLAMNR HKVLTLASMI EKEAVDPSER PIISAVFYNR LKKGMRLQSD PTAVYGVRAF AGKVSKQDIM RHSDYNTYLI NGIPPGPIGN PSSAAIEAVL SPAQCDYLYF VAKKDGNHFF SKNLEEHNQA VNRYLKSSAA APPATQHIAG YTNDQPNLTG RR
|
| |