Gene EcDH1_3101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3101 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3330447 
End bp3331808 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content52% 
IMG OID 
Productallantoinase 
Protein accessionACX40727 
Protein GI260450305 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTTG ATTTAATCAT TAAAAACGGC ACCGTTATTT TAGAAAACGA AGCTCGCGTT 
GTAGATATCG CCGTTAAAGG CGGAAAAATT GCTGCTATCG GTCAGGATCT GGGCGATGCA
AAAGAAGTTA TGGATGCGTC TGGTCTGGTG GTTTCGCCGG GCATGGTTGA TGCGCACACC
CATATTTCTG AACCGGGTCG TAGCCACTGG GAAGGTTATG AAACCGGTAC TCGCGCAGCG
GCAAAAGGTG GTATCACCAC CATGATCGAA ATGCCGCTCA ACCAGCTGCC TGCAACGGTT
GACCGCGCTT CAATTGAACT GAAGTTCGAT GCCGCTAAAG GCAAGCTGAC TATTGATGCG
GCACAACTCG GTGGCCTGGT GTCTTACAAC ATCGACCGTC TGCATGAGCT GGATGAAGTG
GGCGTTGTCG GCTTCAAATG CTTCGTTGCG ACCTGTGGCG ATCGCGGTAT CGACAACGAC
TTCCGTGATG TAAACGACTG GCAGTTCTTC AAAGGTGCGC AGAAGCTGGG CGAACTGGGT
CAGCCGGTGC TGGTGCACTG CGAAAACGCG CTGATTTGTG ACGAACTGGG CGAAGAAGCG
AAGCGTGAAG GTCGCGTAAC CGCTCATGAC TATGTGGCTT CGCGTCCGGT ATTTACCGAA
GTGGAAGCAA TTCGCCGCGT ACTGTATCTG GCGAAAGTTG CTGGTTGCCG TCTGCACGTT
TGCCACGTCA GCAGCCCGGA AGGTGTTGAG GAAGTGACTC GTGCACGTCA GGAAGGTCAG
GACGTTACTT GTGAATCCTG CCCGCATTAC TTTGTACTGG ATACCGATCA GTTCGAAGAA
ATCGGTACTC TGGCGAAGTG TTCACCGCCG ATCCGCGATC TGGAAAACCA GAAAGGCATG
TGGGAAAAAC TGTTTAACGG TGAAATCGAC TGCCTGGTTT CCGACCACTC TCCATGCCCG
CCGGAAATGA AAGCCGGTAA CATCATGAAA GCATGGGGCG GTATCGCCGG TCTGCAAAGC
TGCATGGACG TGATGTTCGA TGAAGCGGTA CAGAAACGCG GTATGTCTCT GCCAATGTTC
GGCAAATTAA TGGCGACTAA CGCAGCAGAT ATTTTCGGTC TGCAGCAAAA AGGCCGTATC
GCCCCAGGAA AAGATGCCGA CTTCGTCTTC ATTCAGCCGA ATAGCAGCTA TGTTCTTACC
AATGACGATC TGGAATATCG CCACAAAGTC AGCCCGTATG TTGGCCGTAC CATTGGCGCG
CGTATCACGA AAACCATCTT ACGTGGTGAT GTGATTTACG ACATTGAACA GGGCTTCCCT
GTTGCGCCGA AAGGTCAATT TATCCTTAAA CATCAGCAGT AA
 
Protein sequence
MSFDLIIKNG TVILENEARV VDIAVKGGKI AAIGQDLGDA KEVMDASGLV VSPGMVDAHT 
HISEPGRSHW EGYETGTRAA AKGGITTMIE MPLNQLPATV DRASIELKFD AAKGKLTIDA
AQLGGLVSYN IDRLHELDEV GVVGFKCFVA TCGDRGIDND FRDVNDWQFF KGAQKLGELG
QPVLVHCENA LICDELGEEA KREGRVTAHD YVASRPVFTE VEAIRRVLYL AKVAGCRLHV
CHVSSPEGVE EVTRARQEGQ DVTCESCPHY FVLDTDQFEE IGTLAKCSPP IRDLENQKGM
WEKLFNGEID CLVSDHSPCP PEMKAGNIMK AWGGIAGLQS CMDVMFDEAV QKRGMSLPMF
GKLMATNAAD IFGLQQKGRI APGKDADFVF IQPNSSYVLT NDDLEYRHKV SPYVGRTIGA
RITKTILRGD VIYDIEQGFP VAPKGQFILK HQQ