Gene Gura_3618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3618 
Symbol 
ID5166677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4241221 
End bp4242309 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content54% 
IMG OID640551103 
Productaminodeoxychorismate lyase 
Protein accessionYP_001232345 
Protein GI148265639 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000276729 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGAT ATTTATATTT CTTACATAAC AAAAAGTTGA GCCTGCTTCT GGTTACCGGA 
TTATTACTAC TTGCGCCGGT TTTACGATTC GCATTTTTCC TCACCACATC TGCCGGCGAC
GGCAGGAACG TGCAGATGCT GGACATCGGC CACGGTTCAA GTCCGGGGAA AATGGCTGCT
GACCTCGAAA CGAAGAAAAT CATCTCCAGC GCCAGGCTGT TTACCCTCTA CACCCGATTC
AGCGGCGCCG ACGCCAGATT GAAAGCCGGA CTTTACCAGT TCAACGACGG CATGAAACCA
ACGGAAATCG TGCATAAGAT GGTGGCCGGA GATGTTTACC TCCGTCTCTT TGCCCTGCCC
GAAGGATACT CCACATACCA GGCGGCGGAA CTGCTCCAGT CCCGCAGGTT TTTCAGCAAG
GAATCGTTCC TCAAGCAGTG CGTAAACAGG AAACTACTCG CTGAACTCGG CATTCCGGGC
AAAAGCGTTG AAGGCTACCT CTATCCCGGC GCCTACAACA TCCCCCCGAA CATGGACGAA
GCTGAGCTGA TCCGGCAGAT GGTGCGGAAG TTCAACGAGG TGTATGCGGA CAAGTTCGAC
GACCGGGCAA AAAAACTGGC AATGAACCGC CATAAAGTTC TGACCCTGGC CTCGATGATT
GAGAAGGAGG CAGTCGACCC CTCCGAGCGC CCCATCATCT CTGCCGTCTT TTACAACCGG
CTGAAAAAGG GGATGCGGCT GCAGAGCGAC CCGACCGCTG TCTACGGTGT GCGTGCATTT
GCCGGCAAGG TGTCGAAGCA GGACATCATG CGTCACTCCG ATTACAACAC CTATCTGATA
AACGGTATCC CCCCGGGACC CATAGGCAAT CCGAGCAGCG CGGCCATCGA AGCGGTTCTC
AGCCCGGCCC AATGCGACTA CCTCTACTTC GTGGCGAAAA AGGACGGCAA TCACTTTTTT
TCCAAAAACC TGGAAGAACA TAACCAGGCA GTGAACCGAT ATCTGAAATC TTCCGCAGCC
GCTCCTCCAG CAACGCAACA CATCGCGGGG TACACGAATG ACCAGCCGAA TCTTACTGGC
AGAAGATAA
 
Protein sequence
MKRYLYFLHN KKLSLLLVTG LLLLAPVLRF AFFLTTSAGD GRNVQMLDIG HGSSPGKMAA 
DLETKKIISS ARLFTLYTRF SGADARLKAG LYQFNDGMKP TEIVHKMVAG DVYLRLFALP
EGYSTYQAAE LLQSRRFFSK ESFLKQCVNR KLLAELGIPG KSVEGYLYPG AYNIPPNMDE
AELIRQMVRK FNEVYADKFD DRAKKLAMNR HKVLTLASMI EKEAVDPSER PIISAVFYNR
LKKGMRLQSD PTAVYGVRAF AGKVSKQDIM RHSDYNTYLI NGIPPGPIGN PSSAAIEAVL
SPAQCDYLYF VAKKDGNHFF SKNLEEHNQA VNRYLKSSAA APPATQHIAG YTNDQPNLTG
RR