Gene EcolC_3111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3111 
Symbol 
ID6066317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3407035 
End bp3408396 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content51% 
IMG OID641602527 
Productallantoinase 
Protein accessionYP_001726061 
Protein GI170021107 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type
[TIGR03178] allantoinase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.395546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTTG ATTTAATCAT TAAAAACGGC ACCGTTATTT TAGAAAACGA AGCTCGCGTT 
GTAGATATCG CCGTTAAAGG CGGAAAAATT GCTGCTATCG GTCAGGATCT GGGCGATGCA
AAAGAAGTTA TGGATGCGTC TGGTCTGGTG GTTTCGCCGG GCATGGTTGA TGCGCACACC
CATATTTCTG AACCGGGTCG TAGCCACTGG GAAGGTTATG AAACCGGTAC TCGCGCAGCG
GCAAAAGGTG GTATCACCAC CATGATCGAA ATGCCGCTCA ACCAGCTGCC TGCAACGGTT
GACCGTGCTT CAATTGAACT GAAGTTCGAT GCCGCTAAAG GCAAGCTGAC TATCGATGCG
GCACAACTCG GTGGCCTGGT GTCTTACAAC ATTGATCGTC TGCATGAGTT GGATGAAGTG
GGCGTTGTCG GCTTCAAATG CTTCGTTGCG ACCTGTGGCG ATCGCGGTAT CGACAACGAC
TTCCGTGATG TAAACGACTG GCAGTTCTTC AAAGGTGCGC AGAAGCTGGG CGAACTGGGG
CAGCCGGTGC TGGTGCACTG CGAAAACGCG CTGATCTGTG ATGCACTGGG CGAAGAAGCG
AAAAGTGAAG GTCGCGTAAC TGCCCATGAC TATGTGGCTT CGCGTCCGGT ATTTACCGAA
GTGGAAGCGA TTCGCCGCGT ACTGTACCTG GCGAAAGTTG CCGGTTGCCG TCTGCACATT
TGCCATATCA GCAGCCCAGA AGGTGTTGAA GAAGTGACTC GTGCACGTCA GGAAGGTCAG
GATGTTACTT GTGAATCCTG CCCGCATTAC TTTGTACTGG ATACCGATCA GTTCGAAGAA
ATCGGTACTC TGGCGAAGTG TTCACCGCCG ATCCGCGATC TGGAAAACCA GAAAGGCATG
TGGGAAAAAC TGTTTAACGG TGAAATCGAC TGCCTGGTTT CCGACCACTC ACCATGCCCT
CCGGAAATGA AAGCCGGCAA CATCATGGAA GCATGGGGCG GTATTGCCGG TCTGCAAAAC
TGTATGGACG TGATGTTCGA TGAAGCGGTA CAGAAACGCG GAATGTCTCT GCCAATGTTC
GGCAAATTAA TGGCGACTAA CGCAGCAGAT ATTTTCGGTC TGCAGCAAAA AGGCCGTATC
GCCCCAGGAA AAGATGCCGA CTTCGTCTTC ATTCAGCCGA ATAGCAGCTA TGTTCTTACC
AATGACGATC TGGAATATCG CCACAAAGTC AGCCCGTATG TTGGCCGTAC CATTGGCGCG
CGTATCACGA AAACCATCTT ACGTGGTGAT GTGATTTACG ACATTGAACA GGGCTTCCCT
GTTGCGCCGA AAGGTCAATT TATCCTTAAA CATCAGCAGT AA
 
Protein sequence
MSFDLIIKNG TVILENEARV VDIAVKGGKI AAIGQDLGDA KEVMDASGLV VSPGMVDAHT 
HISEPGRSHW EGYETGTRAA AKGGITTMIE MPLNQLPATV DRASIELKFD AAKGKLTIDA
AQLGGLVSYN IDRLHELDEV GVVGFKCFVA TCGDRGIDND FRDVNDWQFF KGAQKLGELG
QPVLVHCENA LICDALGEEA KSEGRVTAHD YVASRPVFTE VEAIRRVLYL AKVAGCRLHI
CHISSPEGVE EVTRARQEGQ DVTCESCPHY FVLDTDQFEE IGTLAKCSPP IRDLENQKGM
WEKLFNGEID CLVSDHSPCP PEMKAGNIME AWGGIAGLQN CMDVMFDEAV QKRGMSLPMF
GKLMATNAAD IFGLQQKGRI APGKDADFVF IQPNSSYVLT NDDLEYRHKV SPYVGRTIGA
RITKTILRGD VIYDIEQGFP VAPKGQFILK HQQ