Gene EcDH1_4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4020 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4348987 
End bp4350831 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content50% 
IMG OID 
ProductTonB-dependent vitamin B12 receptor 
Protein accessionACX41620 
Protein GI260451198 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.00348064 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAAA AAGCTTCGCT GCTGACGGCG TGTTCCGTCA CGGCATTTTC CGCTTGGGCA 
CAGGATACCA GCCCGGATAC TCTCGTCGTT ACTGCTAACC GTTTTGAACA GCCGCGCAGC
ACTGTGCTTG CACCAACCAC CGTTGTGACC CGTCAGGATA TCGACCGCTG GCAGTCGACC
TCGGTCAATG ATGTGCTGCG CCGTCTTCCG GGCGTCGATA TCACCCAAAA CGGCGGTTCA
GGTCAGCTCT CATCTATTTT TATTCGCGGT ACAAATGCCA GTCATGTGTT GGTGTTAATT
GATGGCGTAC GCCTGAATCT GGCGGGGGTG AGTGGTTCTG CCGACCTTAG CCAGTTCCCT
ATTGCGCTTG TCCAGCGTGT TGAATATATC CGTGGGCCGC GCTCCGCTGT TTATGGTTCC
GATGCAATAG GCGGGGTGGT GAATATCATC ACGACGCGCG ATGAACCCGG AACGGAAATT
TCAGCAGGGT GGGGAAGCAA TAGTTATCAG AACTATGATG TCTCTACGCA GCAACAACTG
GGGGATAAGA CACGGGTAAC GCTGTTGGGC GATTATGCCC ATACTCATGG TTATGATGTT
GTTGCCTATG GTAATACCGG AACGCAAGCG CAGACAGATA ACGATGGTTT TTTAAGTAAA
ACGCTTTATG GCGCGCTGGA GCATAACTTT ACTGATGCCT GGAGCGGCTT TGTGCGCGGC
TATGGCTATG ATAACCGTAC CAATTATGAC GCGTATTATT CTCCCGGTTC ACCGTTGCTC
GATACCCGTA AACTCTATAG CCAAAGTTGG GACGCCGGGC TGCGCTATAA CGGCGAACTG
ATTAAATCAC AACTCATTAC CAGCTATAGC CATAGCAAAG ATTACAACTA CGATCCCCAT
TATGGTCGTT ATGATTCGTC GGCGACGCTC GATGAGATGA AGCAATACAC CGTCCAGTGG
GCAAACAATG TCATCGTTGG TCACGGTAGT ATTGGTGCGG GTGTCGACTG GCAGAAACAG
ACTACGACGC CGGGTACAGG TTATGTTGAG GATGGATATG ATCAACGTAA TACCGGCATC
TATCTGACCG GGCTGCAACA AGTCGGCGAT TTTACCTTTG AAGGCGCAGC ACGCAGTGAC
GATAACTCAC AGTTTGGTCG TCATGGAACC TGGCAAACCA GCGCCGGTTG GGAATTCATC
GAAGGTTATC GCTTCATTGC TTCCTACGGG ACATCTTATA AGGCACCAAA TCTGGGGCAA
CTGTATGGCT TCTACGGAAA TCCGAATCTG GACCCGGAGA AAAGCAAACA GTGGGAAGGC
GCGTTTGAAG GCTTAACCGC TGGGGTGAAC TGGCGTATTT CCGGATATCG TAACGATGTC
AGTGACTTGA TCGATTATGA TGATCACACC CTGAAATATT ACAACGAAGG GAAAGCGCGG
ATTAAGGGCG TCGAGGCGAC CGCCAATTTT GATACCGGAC CACTGACGCA TACTGTGAGT
TATGATTATG TCGATGCGCG CAATGCGATT ACCGACACGC CGTTGTTACG CCGTGCTAAA
CAGCAGGTGA AATACCAGCT CGACTGGCAG TTGTATGACT TCGACTGGGG TATTACTTAT
CAGTATTTAG GCACTCGCTA TGATAAGGAT TACTCATCTT ATCCTTATCA AACCGTTAAA
ATGGGCGGTG TGAGCTTGTG GGATCTTGCG GTTGCGTATC CGGTCACCTC TCACCTGACA
GTTCGTGGTA AAATAGCCAA CCTGTTCGAC AAAGATTATG AGACAGTCTA TGGCTACCAA
ACTGCAGGAC GGGAATACAC CTTGTCTGGC AGCTACACCT TCTGA
 
Protein sequence
MIKKASLLTA CSVTAFSAWA QDTSPDTLVV TANRFEQPRS TVLAPTTVVT RQDIDRWQST 
SVNDVLRRLP GVDITQNGGS GQLSSIFIRG TNASHVLVLI DGVRLNLAGV SGSADLSQFP
IALVQRVEYI RGPRSAVYGS DAIGGVVNII TTRDEPGTEI SAGWGSNSYQ NYDVSTQQQL
GDKTRVTLLG DYAHTHGYDV VAYGNTGTQA QTDNDGFLSK TLYGALEHNF TDAWSGFVRG
YGYDNRTNYD AYYSPGSPLL DTRKLYSQSW DAGLRYNGEL IKSQLITSYS HSKDYNYDPH
YGRYDSSATL DEMKQYTVQW ANNVIVGHGS IGAGVDWQKQ TTTPGTGYVE DGYDQRNTGI
YLTGLQQVGD FTFEGAARSD DNSQFGRHGT WQTSAGWEFI EGYRFIASYG TSYKAPNLGQ
LYGFYGNPNL DPEKSKQWEG AFEGLTAGVN WRISGYRNDV SDLIDYDDHT LKYYNEGKAR
IKGVEATANF DTGPLTHTVS YDYVDARNAI TDTPLLRRAK QQVKYQLDWQ LYDFDWGITY
QYLGTRYDKD YSSYPYQTVK MGGVSLWDLA VAYPVTSHLT VRGKIANLFD KDYETVYGYQ
TAGREYTLSG SYTF