Gene EcSMS35_0554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0554 
SymbolallB 
ID6144951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp562337 
End bp563698 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content52% 
IMG OID641615448 
Productallantoinase 
Protein accessionYP_001742655 
Protein GI170682646 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type
[TIGR03178] allantoinase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTTG ATTTAATCAT TAAAAACGGC ACCGTTATTT TAGAAAACGA AGCTCGCGTA 
GTGGATGTCG CCGTTAAAGG CGGAAAAATT GCTGCTATCG GTCAGGATCT GGGCGATGCA
AAAGAAGTTA TGGATGCGTC TGGTCTGGTG GTTTCGCCAG GCATGGTTGA TGCGCACACC
CATATTTCTG AACCGGGTCG CAGCCACTGG GAAGGTTATG AAACCGGTAC TCGCGCAGCA
GCAAAAGGTG GTATCACCAC CATGATCGAA ATGCCGCTCA ACCAGCTGCC TGCAACGGTT
GACCGCGCAT CGATTGAACT GAAGTTTGAT GCCGCCAAAG GCAAGCTGAC TATCGATGCG
GCGCAACTCG GTGGCCTGGT GTCTTACAAC ATCGACCGTC TGCATGAGCT GGATGAAGTG
GGCGTTGTCG GCTTCAAATG CTTCGTTGCG ACCTGTGGCG ATCGCGGTAT CGACAACGAC
TTCCGTGACG TCAATGACTG GCAGTTCTTC AAAGGTGCGC AGAAGCTGGG CGAACTGGGA
CAGCCGGTGC TGGTGCACTG CGAAAACGCG CTGATCTGTG ACGAACTTGG CGAAGAAGCG
AAACGTGAAG GTCGCGTAAC CGCACATGAC TATGTGGCTT CGCGTCCGGT ATTTACCGAA
GTGGAAGCGA TTCGCCGCGT GCTGTACCTG GCGAAAGTTG CCGGTTGCCG TCTGCACGTT
TGCCATATCA GCAGCCCGGA AGGTGTTGAA GAAGTGACTC GTGCACGTCA GGAAGGCCAG
GATGTTACCT GTGAATCCTG CCCGCATTAC TTTGTGCTGG ATACCGATCA GTTCGAAGAA
ATTGGCACCC TGGCGAAGTG TTCACCGCCG ATCCGCGATC TGGAAAACCA GAAAGGCATG
TGGGAAAAAC TGTTTAACGG TGAAATAGAC TGCCTGGTTT CCGACCACTC TCCATGCCCG
CCGGAAATGA AAGCCGGTAA CATCATGAAA GCGTGGGGCG GTATCGCTGG TCTGCAAAGC
TGCATGGACG TGATGTTCGA TGAAGCGGTA CAGAAACGCG GAATGTCTCT GCCAATGTTC
GGCAAATTAA TGGCGACTAA CGCAGCAGAT ATTTTCGGTC TGCAGCAAAA AGGCCGTATC
GCCCCAGGAA AAGATGCCGA CTTCGTCTTC ATTCAGCCGA ATAGCAGCTA TGTTCTTACC
AATGACGATC TGGAATATCG CCACAAAGTC AGCCCGTATG TTGGCCGTAC TATTGGCGCG
CGTATCACGA AAACCATCTT ACGTGGTGAT GTGATTTACG ACATCGAACA GGGCTTCCCT
GTTGCGCCGA AAGGTCAATT TATCCTTAAA CATCAGCAGT AA
 
Protein sequence
MSFDLIIKNG TVILENEARV VDVAVKGGKI AAIGQDLGDA KEVMDASGLV VSPGMVDAHT 
HISEPGRSHW EGYETGTRAA AKGGITTMIE MPLNQLPATV DRASIELKFD AAKGKLTIDA
AQLGGLVSYN IDRLHELDEV GVVGFKCFVA TCGDRGIDND FRDVNDWQFF KGAQKLGELG
QPVLVHCENA LICDELGEEA KREGRVTAHD YVASRPVFTE VEAIRRVLYL AKVAGCRLHV
CHISSPEGVE EVTRARQEGQ DVTCESCPHY FVLDTDQFEE IGTLAKCSPP IRDLENQKGM
WEKLFNGEID CLVSDHSPCP PEMKAGNIMK AWGGIAGLQS CMDVMFDEAV QKRGMSLPMF
GKLMATNAAD IFGLQQKGRI APGKDADFVF IQPNSSYVLT NDDLEYRHKV SPYVGRTIGA
RITKTILRGD VIYDIEQGFP VAPKGQFILK HQQ