Gene EcDH1_1657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1657 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1803397 
End bp1806516 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content58% 
IMG OID 
Productouter membrane autotransporter barrel domain protein 
Protein accessionACX39321 
Protein GI260448899 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.656845 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGAC ATCTGAATAC CTGCTACAGG CTGGTATGGA ATCACATGAC GGGCGCTTTC 
GTGGTTGCCT CCGAACTGGC CCGCGCACGG GGTAAACGTG GCGGTGTGGC GGTTGCACTG
TCTCTTGCCG CAGTCACGTC ACTCCCGGTG CTGGCTGCTG ACATCGTTGT GCACCCGGGA
GAAACCGTGA ACGGCGGAAC ACTGGCAAAT CATGACAACC AGATTGTCTT CGGTACGACC
AACGGAATGA CCATCAGTAC CGGGCTGGAG TATGGGCCGG ATAACGAGGC CAATACCGGC
GGGCAATGGG TACAGGATGG CGGAACAGCC AACAAAACGA CTGTCACCAG TGGTGGTCTT
CAGAGAGTGA ACCCCGGTGG AAGTGTCTCA GACACGGTTA TCAGTGCCGG AGGCGGACAG
AGCCTTCAGG GACGGGCTGT GAACACCACG CTGAATGGTG GCGAACAGTG GATGCATGAG
GGGGCGATAG CCACAGGAAC CGTCATTAAT GATAAGGGCT GGCAGGTCGT CAAGCCCGGT
ACAGTGGCAA CGGATACCGT TGTTAATACC GGGGCGGAAG GGGGACCGGA TGCAGAAAAC
GGTGATACCG GGCAGTTTGT TCGCGGGGAT GCCGTACGCA CAACCATCAA TAAAAACGGT
CGCCAGATTG TGAGAGCTGA AGGAACGGCA AATACCACTG TGGTTTATGC CGGCGGCGAC
CAGACTGTAC ATGGTCACGC ACTGGATACC ACGCTGAATG GGGGATACCA GTATGTGCAC
AACGGCGGTA CAGCGTCTGA CACTGTTGTG AACAGTGACG GCTGGCAGAT TGTCAAAAAC
GGGGGTGTGG CCGGGAATAC CACCGTTAAT CAGAAGGGCA GACTGCAGGT GGACGCCGGT
GGTACAGCCA CGAATGTCAC CCTGAAGCAG GGCGGCGCAC TGGTTACCAG TACGGCTGCA
ACCGTTACCG GCATAAACCG CCTGGGAGCA TTCTCTGTTG TGGAGGGTAA AGCTGATAAT
GTCGTACTGG AAAATGGCGG ACGCCTGGAT GTGCTGACCG GACACACAGC CACTAATACC
CGCGTGGATG ATGGCGGAAC GCTGGATGTC CGCAACGGTG GCACCGCCAC CACCGTATCC
ATGGGAAATG GCGGTGTACT GCTGGCCGAT TCCGGTGCCG CTGTCAGTGG TACCCGGAGC
GACGGAAAGG CATTCAGTAT CGGAGGCGGT CAGGCGGATG CCCTGATGCT GGAAAAAGGC
AGTTCATTCA CGCTGAACGC CGGTGATACG GCCACGGATA CCACGGTAAA TGGCGGACTG
TTCACCGCCA GGGGCGGCAC ACTGGCGGGC ACCACCACGC TGAATAACGG CGCCATACTT
ACCCTTTCCG GGAAGACGGT GAACAACGAT ACCCTGACCA TCCGTGAAGG CGATGCACTC
CTGCAGGGAG GCTCTCTCAC CGGTAACGGC AGCGTGGAAA AATCAGGAAG TGGCACACTC
ACTGTCAGCA ACACCACACT CACCCAGAAA GCCGTCAACC TGAATGAAGG CACGCTGACG
CTGAACGACA GTACCGTCAC CACGGATGTC ATTGCTCAGC GCGGTACAGC CCTGAAGCTG
ACCGGCAGCA CTGTGCTGAA CGGTGCCATT GACCCCACGA ATGTCACTCT CGCCTCCGGT
GCCACCTGGA ATATCCCCGA TAACGCCACG GTGCAGTCGG TGGTGGATGA CCTCAGCCAT
GCCGGACAGA TTCATTTCAC CTCCACCCGC ACAGGGAAGT TCGTACCGGC AACCCTGAAA
GTGAAAAACC TGAACGGACA GAATGGCACC ATCAGCCTGC GTGTACGCCC GGATATGGCA
CAGAACAATG CTGACAGACT GGTCATTGAC GGCGGCAGGG CAACCGGAAA AACCATCCTG
AACCTGGTGA ACGCCGGCAA CAGTGCGTCG GGGCTGGCGA CCAGCGGTAA GGGTATTCAG
GTGGTGGAAG CCATTAACGG TGCCACCACG GAGGAAGGGG CCTTTGTCCA GGGGAACAGG
CTGCAGGCCG GTGCCTTTAA CTACTCCCTC AACCGGGACA GTGATGAGAG CTGGTATCTG
CGCAGTGAAA ATGCTTATCG TGCAGAAGTC CCCCTGTATG CCTCCATGCT GACACAGGCA
ATGGACTATG ACCGGATTGT GGCAGGCTCC CGCAGCCATC AGACCGGTGT AAATGGTGAA
AACAACAGCG TCCGTCTCAG CATTCAGGGC GGTCATCTCG GTCACGATAA CAATGGCGGT
ATTGCCCGTG GGGCCACGCC GGAAAGCAGC GGCAGCTATG GATTCGTCCG TCTGGAGGGT
GACCTGATGA GAACAGAGGT TGCCGGTATG TCTGTGACCG CGGGGGTATA TGGTGCTGCT
GGCCATTCTT CCGTTGATGT TAAGGATGAT GACGGCTCCC GTGCCGGCAC GGTCCGGGAT
GATGCCGGCA GCCTGGGCGG ATACCTGAAT CTGGTACACA CGTCCTCCGG CCTGTGGGCT
GACATTGTGG CACAGGGAAC CCGCCACAGC ATGAAAGCGT CATCGGACAA TAACGACTTC
CGCGCCCGGG GCTGGGGCTG GCTGGGCTCA CTGGAAACCG GTCTGCCCTT CAGTATCACT
GACAACCTGA TGCTGGAGCC ACAACTGCAG TATACCTGGC AGGGACTTTC CCTGGATGAC
GGTAAGGACA ACGCCGGTTA TGTGAAGTTC GGGCATGGCA GTGCACAACA TGTGCGTGCC
GGTTTCCGTC TGGGCAGCCA CAACGATATG ACCTTTGGCG AAGGCACCTC ATCCCGTGCC
CCCCTGCGTG ACAGTGCAAA ACACAGTGTG AGTGAATTAC CGGTGAACTG GTGGGTACAG
CCTTCTGTTA TCCGCACCTT CAGCTCCCGG GGAGATATGC GTGTGGGGAC TTCCACTGCA
GGCAGCGGGA TGACGTTCTC TCCCTCACAG AATGGCACAT CACTGGACCT GCAGGCCGGA
CTGGAAGCCC GTGTCCGGGA AAATATCACC CTGGGCGTTC AGGCCGGTTA TGCCCACAGC
GTCAGCGGCA GCAGCGCTGA AGGGTATAAC GGTCAGGCCA CACTGAATGT GACCTTCTGA
 
Protein sequence
MKRHLNTCYR LVWNHMTGAF VVASELARAR GKRGGVAVAL SLAAVTSLPV LAADIVVHPG 
ETVNGGTLAN HDNQIVFGTT NGMTISTGLE YGPDNEANTG GQWVQDGGTA NKTTVTSGGL
QRVNPGGSVS DTVISAGGGQ SLQGRAVNTT LNGGEQWMHE GAIATGTVIN DKGWQVVKPG
TVATDTVVNT GAEGGPDAEN GDTGQFVRGD AVRTTINKNG RQIVRAEGTA NTTVVYAGGD
QTVHGHALDT TLNGGYQYVH NGGTASDTVV NSDGWQIVKN GGVAGNTTVN QKGRLQVDAG
GTATNVTLKQ GGALVTSTAA TVTGINRLGA FSVVEGKADN VVLENGGRLD VLTGHTATNT
RVDDGGTLDV RNGGTATTVS MGNGGVLLAD SGAAVSGTRS DGKAFSIGGG QADALMLEKG
SSFTLNAGDT ATDTTVNGGL FTARGGTLAG TTTLNNGAIL TLSGKTVNND TLTIREGDAL
LQGGSLTGNG SVEKSGSGTL TVSNTTLTQK AVNLNEGTLT LNDSTVTTDV IAQRGTALKL
TGSTVLNGAI DPTNVTLASG ATWNIPDNAT VQSVVDDLSH AGQIHFTSTR TGKFVPATLK
VKNLNGQNGT ISLRVRPDMA QNNADRLVID GGRATGKTIL NLVNAGNSAS GLATSGKGIQ
VVEAINGATT EEGAFVQGNR LQAGAFNYSL NRDSDESWYL RSENAYRAEV PLYASMLTQA
MDYDRIVAGS RSHQTGVNGE NNSVRLSIQG GHLGHDNNGG IARGATPESS GSYGFVRLEG
DLMRTEVAGM SVTAGVYGAA GHSSVDVKDD DGSRAGTVRD DAGSLGGYLN LVHTSSGLWA
DIVAQGTRHS MKASSDNNDF RARGWGWLGS LETGLPFSIT DNLMLEPQLQ YTWQGLSLDD
GKDNAGYVKF GHGSAQHVRA GFRLGSHNDM TFGEGTSSRA PLRDSAKHSV SELPVNWWVQ
PSVIRTFSSR GDMRVGTSTA GSGMTFSPSQ NGTSLDLQAG LEARVRENIT LGVQAGYAHS
VSGSSAEGYN GQATLNVTF