Gene EcDH1_2240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2240 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2404465 
End bp2407788 
Gene Length3324 bp 
Protein Length1107 aa 
Translation table11 
GC content50% 
IMG OID 
ProductAutotransporter beta- domain protein 
Protein accessionACX39888 
Protein GI260449466 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.400804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGGAAC AACATAGCGT ATTTAACAAG TATTCAACGG GCACATCGAA TTCATTTATT 
TTTAATAACG ATGTCAGTAG CATAACAGGG TTAGTCGCTC AATCGAATAG CACAATTATC
AATACTGACA GCGGCATCAT TGATTTGTAT GGTCGTGGTA GTGTCGGCAT GCTTGCTATA
GCAGATTCAA CAGCAGAAAA TCAGGGTAAA ATTACACTGG ATTCTATGTG GGTAGATGCA
AATGACACTA CCGCAATGCG AGATATAGCT AGCAACAGCG CCATTGACTT CGGTACAGGT
GTGGGAGTTG GTACTGATAG TTATAGTGGT GCAGGGAAAA ATGCAACAGC AATTAACCAA
TTGGGCGGTG TTATAACTAT TTATAACGCC GGCGCAGGTA TGGCGGCCTA TGGCGCCAGC
AATACAGTTA TTAACCAGGG GACGATTAAC CTCGAAAAAA ATGGTAATTA TGACGATAGT
CTGGCAGCAA ATACTCTGGT AGGGATGGCT GTTTATGAGC ATGGTACTGC TATCAACGAC
CAGACGGGTG TTATCAATAT CAATGTTGGT ACTGGTCAGG CGTTTTATAA CGATGGCACA
GGAACAATTG TTAACTATGG TACAATCTGC ACTTTCGGCG TGTGCCAATC GGGGAATGAG
TACAATAATA CAGATGATTT CACCTCACTG ATCTATACCG GTGGCGATAC GATTACACGA
AGCGGAGAAA CTGTAACGCT AAATAAATCT GCTGCTGTGA CTGATAAGCT GGCTGGGAAT
GTTGTTAATA GCGGAACGCT TTCCGGTGAT CAAATTACGG TATCAAGCGG TCTTCTGGAA
AATACCAGCG GTGGCATCAT CAATAACTTA GTAAAACTTG ACAAGGGTGC CGTCATTAAA
AATGCCGGGG TGATGACGAA TAACGTCGAT GTTAGCGGTG GAATCCTCAA TAATGCCGGA
GAAATGACTG CGCAAATTAC CATGAATGCT GGTGCTGATA GTTCGTTAGT GAACAACACC
GGAACCATCA ATAAAATCGT GCAGAACGCG GGGGTATTCA ATAATAGTGG CAGTGTAACA
GGGCGGATGA TGTCGGCTGG CGGGGTCTTT AATAATCAAA CTGACGGGGC GATTATGAGA
GGTGCTGCGC TGACAGGTAC TGCAGTGGCA AATAACGAAG GAACCTGGAA CCTCGGAAGT
AGTAGTGAGG GTAACAACAC CGGGATGCTG GAAGTTAATA ATAATTCAGC TTTCAATAAC
CGCGGCGAGT TTATTCTTGA TAACGACAAG AATGCTGTGC ACATCAACCA GTCCGGTACG
CTTTATAATA CCGGTCACAT GAACATCAGT AATTCTTCCC ACAACGGAGC CGTTAATATG
TGGGGCGGAA ATGGTCGTTT TATCAATGAC GGAACGATTG ATGTTTCTGC GAAGTCACTG
GTAGTCAGCG CTAATAATGC CGGCGATCAG AATGCCTTCT TCTGGAACCA GGATAACGGG
GTCATCAACT TCGATCACGA CAGCGCCAGT GCCGTGAAAG TCACCCACAG CAACTTTATT
GCCCAGAATG ACGGCATCAT GAACATCAGC GGCACCGGTG CTGTGGCTAT GGAAGGTGAT
AAGAACGCGC AGCTGGTTAA CAATGGCACC ATCAACCTCG GTACCGCAGG CACTACTGAC
ACGGGTATGA TCGGTATGCA ACTCGATGCC AACGCCACGG CGGATGCGGT AATCGAAAAC
AACGGCACCA TCAATATCTT CGCCAATGAC TCGTTTGCAT TTAGCGTACT GGGTACAGTA
GGTCATGTGG TTAACAACGG CACGGTGGTG ATTGCCGATG GGGTTACGGG TTCTGGACTG
ATCAAGCAGG GCGACAGCAT CAATGTTGAA GGTATGAACG GTAACAACGG TAATAGCAGC
GAAGTGCATT ATGGCGACTA TACGTTGCCG GATGTGCCGA AGCCCAATAC GGTTAGTGTA
ACGTCGGGAA GTGATGAGGC TGGTGGCAGC ATGAACAACC TCAACGGCTA TGTCGTCGGT
ACCAACGTTA ACGGCAGCGC CGGGAAGCTG AAGGTTAACA ATGCCAGCAT GAACGGCGTG
GAGATTAACA CGGGCTTTAC CGCTGGTACG GCAGACACCA CTGTGAGTTT TGATAACGTA
GTGGAAGGTA GCAACCTGAC CGACGCTGAC GCCATCACCT CAACGTCCGT GGTATGGACT
GCCAAAGGCA GCACCGATGC CAGCGGTAAC GTTGACGTCA CCATGAGCAA AAATGCCTAC
ACCGATGTGG CAACAGATGC CTCGGTGAAT GACATCGCGA AAGCACTGGA TGCGGGTTAC
ACCAACAACG AACTGTTTAC CAGCCTGAAC GTCGGCACGA CTGCTGAACT GAACAGTGCT
CTGAAACAGG TCAGCGGTAG CCAGGCGACC ACGGTATTCC GCGAAGCGCG CGTGTTAAGC
AACCGCTTTA GTATGCTGGC AGATGCCGCG CCGAAAGTGG GTAACGGTCT GGCGTTCAAC
GTTGTCGCGA AAGGCGATCC GCGTGCCGAG TTAGGTAATA ATACCGAATA CGACATGCTG
GCATTGCGTA AAACTATCGA CCTGAGCGAA AGCCAGACGA TGAGTCTGGA GTACGGTATC
GCTCGTCTCG ATGGTGATGG TGCGCAGAAA GCGGGTGATA ATGGCGTTAC AGGCGGTTAT
AGCCAGTTTT TTGGCCTGAA ACATCAGATG TCGTTCGATA ACGGCATGAA CTGGAATAAC
GCCTTGCGTT ACGACGTTCA CAACCTTGAC AGCAGCCGCT CGATTGCATT TGGCAACACG
AACAAAACGG CTGATACCGA CGTGAAACAG CAGTACCTGG AGTTCCGCAG CGAAGGGGCG
AAGACTTTCG AACCGAGCGA AGGACTGAAG GTTACGCCAT ATGCGGGTGT AAAACTGCGT
CACACACTGG AAGGTGGCTA TCAGGAGCGC AATGCCGGAG ACTTTAACCT GAATATGAAC
AGTGGCAGCG AAACGGCGGT GGACAGCATC GTCGGGCTGA AACTGGACTA CGCAGGTAAA
GACGGCTGGA GCGCTAGCGC TACGCTGGAA GGCGGGCCGA ACCTGAGCTA CGCGAAGAGC
CAGCGTACGG CAAGCCTGGC AGGCGCAGGC AGTCAGCACT TTAACGTCGA TGACGGTCAG
AAGGGCGGCG GCATCAATAG CCTGACAAGC GTCGGCGTGA AGTACAGCAG CAAAGAAAGT
TCGCTGAATC TGGATGCGTA CAACTGGAAA GAGGATGGCA TCAGCGATAA AGGCGTGATG
CTGAACTTCA AGAAAACGTT CTAA
 
Protein sequence
MTEQHSVFNK YSTGTSNSFI FNNDVSSITG LVAQSNSTII NTDSGIIDLY GRGSVGMLAI 
ADSTAENQGK ITLDSMWVDA NDTTAMRDIA SNSAIDFGTG VGVGTDSYSG AGKNATAINQ
LGGVITIYNA GAGMAAYGAS NTVINQGTIN LEKNGNYDDS LAANTLVGMA VYEHGTAIND
QTGVININVG TGQAFYNDGT GTIVNYGTIC TFGVCQSGNE YNNTDDFTSL IYTGGDTITR
SGETVTLNKS AAVTDKLAGN VVNSGTLSGD QITVSSGLLE NTSGGIINNL VKLDKGAVIK
NAGVMTNNVD VSGGILNNAG EMTAQITMNA GADSSLVNNT GTINKIVQNA GVFNNSGSVT
GRMMSAGGVF NNQTDGAIMR GAALTGTAVA NNEGTWNLGS SSEGNNTGML EVNNNSAFNN
RGEFILDNDK NAVHINQSGT LYNTGHMNIS NSSHNGAVNM WGGNGRFIND GTIDVSAKSL
VVSANNAGDQ NAFFWNQDNG VINFDHDSAS AVKVTHSNFI AQNDGIMNIS GTGAVAMEGD
KNAQLVNNGT INLGTAGTTD TGMIGMQLDA NATADAVIEN NGTINIFAND SFAFSVLGTV
GHVVNNGTVV IADGVTGSGL IKQGDSINVE GMNGNNGNSS EVHYGDYTLP DVPKPNTVSV
TSGSDEAGGS MNNLNGYVVG TNVNGSAGKL KVNNASMNGV EINTGFTAGT ADTTVSFDNV
VEGSNLTDAD AITSTSVVWT AKGSTDASGN VDVTMSKNAY TDVATDASVN DIAKALDAGY
TNNELFTSLN VGTTAELNSA LKQVSGSQAT TVFREARVLS NRFSMLADAA PKVGNGLAFN
VVAKGDPRAE LGNNTEYDML ALRKTIDLSE SQTMSLEYGI ARLDGDGAQK AGDNGVTGGY
SQFFGLKHQM SFDNGMNWNN ALRYDVHNLD SSRSIAFGNT NKTADTDVKQ QYLEFRSEGA
KTFEPSEGLK VTPYAGVKLR HTLEGGYQER NAGDFNLNMN SGSETAVDSI VGLKLDYAGK
DGWSASATLE GGPNLSYAKS QRTASLAGAG SQHFNVDDGQ KGGGINSLTS VGVKYSSKES
SLNLDAYNWK EDGISDKGVM LNFKKTF