Gene EcHS_A0585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0585 
SymbolallB 
ID5592280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp598724 
End bp600085 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content51% 
IMG OID640919769 
Productallantoinase 
Protein accessionYP_001457352 
Protein GI157160034 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type
[TIGR03178] allantoinase 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTTG ATTTAATCAT TAAAAACGGC ACCGTTATTT TAGAAAACGA AGCTCGCGTT 
GTAGATATCG CCGTTAAAGG CGGAAAAATT GCTGCTATCG GTCAGGATCT GGGCGATGCA
AAAGAAGTTA TGGATGCGTC TGGTCTGGTG GTTTCGCCGG GCATGGTTGA TGCGCACACC
CATATTTCTG AACCGGGTCG TAGCCACTGG GAAGGTTATG AAACCGGTAC TCGCGCAGCG
GCAAAAGGTG GTATCACCAC CATGATCGAA ATGCCGCTCA ACCAGCTGCC TGCAACGGTT
GACCGTGCTT CAATTGAACT GAAGTTCGAT GCCGCTAAAG GCAAGCTGAC TATCGATGCG
GCACAACTCG GTGGCCTGGT GTCTTACAAC ATTGATCGTC TGCATGAGTT GGATGAAGTG
GGCGTTGTCG GCTTCAAATG CTTCGTTGCG ACCTGTGGCG ATCGCGGTAT CGACAACGAC
TTCCGTGATG TAAACGACTG GCAGTTCTTC AAAGGTGCGC AGAAGCTGGG CGAACTGGGG
CAGCCGGTGC TGGTGCACTG CGAAAACGCG CTGATCTGTG ATGCACTGGG CGAAGAAGCG
AAAAGTGAAG GTCGCGTAAC TGCCCATGAC TATGTGGCTT CGCGTCCGGT ATTTACCGAA
GTGGAAGCGA TTCGCCGCGT ACTGTACCTG GCGAAAGTTG CCGGTTGCCG TCTGCACATT
TGCCATATCA GCAGCCCAGA AGGTGTTGAA GAAGTGACTC GTGCACGTCA GGAAGGTCAG
GATGTTACTT GTGAATCCTG CCCGCATTAC TTTGTACTGG ATACCGATCA GTTCGAAGAA
ATCGGTACTC TGGCGAAGTG TTCACCGCCG ATCCGCGATC TGGAAAACCA GAAAGGCATG
TGGGAAAAAC TGTTTAACGG TGAAATCGAC TGCCTGGTTT CCGACCACTC ACCATGCCCT
CCGGAAATGA AAGCCGGCAA CATCATGGAA GCATGGGGCG GTATTGCCGG TCTGCAAAAC
TGTATGGACG TGATGTTCGA TGAAGCGGTA CAGAAACGCG GAATGTCTCT GCCAATGTTC
GGCAAATTAA TGGCGACTAA CGCAGCAGAT ATTTTCGGTC TGCAGCAAAA AGGCCGTATC
GCCCCAGGAA AAGATGCCGA GTTCGTCTTC ATTCAGCCGA ATAGCAGCTA TGTTCTTACC
AATGACGATC TGGAATATCG CCACAAAGTC AGCCCGTATG TTGGCCGTAC CATTGGCGCG
CGTATCACGA AAACCATCTT ACGTGGTGAT GTGATTTACG ACATTGAACA GGGCTTCCCT
GTTGCGCCGA AAGGTCAATT TATCCTTAAA CATCAGCAGT AA
 
Protein sequence
MSFDLIIKNG TVILENEARV VDIAVKGGKI AAIGQDLGDA KEVMDASGLV VSPGMVDAHT 
HISEPGRSHW EGYETGTRAA AKGGITTMIE MPLNQLPATV DRASIELKFD AAKGKLTIDA
AQLGGLVSYN IDRLHELDEV GVVGFKCFVA TCGDRGIDND FRDVNDWQFF KGAQKLGELG
QPVLVHCENA LICDALGEEA KSEGRVTAHD YVASRPVFTE VEAIRRVLYL AKVAGCRLHI
CHISSPEGVE EVTRARQEGQ DVTCESCPHY FVLDTDQFEE IGTLAKCSPP IRDLENQKGM
WEKLFNGEID CLVSDHSPCP PEMKAGNIME AWGGIAGLQN CMDVMFDEAV QKRGMSLPMF
GKLMATNAAD IFGLQQKGRI APGKDAEFVF IQPNSSYVLT NDDLEYRHKV SPYVGRTIGA
RITKTILRGD VIYDIEQGFP VAPKGQFILK HQQ