Gene EcDH1_3643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3643 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3921750 
End bp3923900 
Gene Length2151 bp 
Protein Length716 aa 
Translation table11 
GC content56% 
IMG OID 
Productcarbon starvation protein CstA 
Protein accessionACX41256 
Protein GI260450834 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATACTA AAAAGATATT CAAGCACATA CCCTGGGTGA TTCTCGGAAT CATCGGTGCA 
TTCTGCCTCG CGGTAGTTGC ATTACGTCGG GGGGAGCACA TCAGCGCCCT GTGGATCGTG
GTCGCCTCTG TATCGGTGTA TCTGGTGGCG TATCGCTACT ACAGTCTGTA CATCGCCCAG
AAGGTGATGA AACTCGACCC CACGCGCGCG ACGCCTGCGG TTATTAACAA CGACGGTCTG
AACTACGTTC CGACCAACCG TTACGTGTTG TTTGGTCACC ACTTCGCCGC TATCGCCGGT
GCTGGTCCGC TGGTGGGTCC GGTTCTCGCC GCGCAGATGG GCTACCTGCC TGGCACGCTG
TGGCTGCTGG CGGGGGTCGT GCTGGCCGGT GCGGTTCAGG ACTTTATGGT GCTGTTTATC
TCCTCTCGCC GTAATGGCGC ATCTCTTGGT GAGATGATCA AAGAAGAGAT GGGACCAGTA
CCGGGGACTA TCGCGCTGTT TGGCTGTTTC TTAATCATGA TCATCATCCT CGCCGTCCTG
GCGCTGATTG TGGTTAAAGC CCTGGCCGAA AGTCCGTGGG GTGTCTTCAC CGTTTGCTCA
ACCGTACCGA TTGCGCTGTT TATGGGTATC TACATGCGCT TTATTCGTCC GGGGCGTGTG
GGTGAAGTCT CTGTCATTGG TATCGTGCTG CTGGTTGCCT CTATCTACTT CGGTGGCGTG
ATTGCTCACG ATCCGTACTG GGGTCCGGCA CTGACCTTTA AAGACACCAC CATTACCTTC
GCGCTGATTG GCTATGCGTT TGTTTCCGCA CTGCTGCCAG TGTGGCTGAT CCTCGCACCG
CGCGACTATC TGGCAACCTT CCTGAAAATC GGCGTTATCG TCGGCCTGGC GCTGGGTATC
GTGGTGCTGA ACCCGGAACT GAAAATGCCT GCCATGACCC AGTACATTGA CGGTACTGGC
CCGCTGTGGA AAGGCGCTCT GTTCCCGTTC CTGTTCATCA CCATCGCCTG TGGTGCGGTA
TCTGGCTTCC ACGCGCTGAT CTCTTCCGGT ACGACGCCAA AACTGCTGGC TAATGAAACC
GACGCGCGTT TCATCGGCTA CGGCGCAATG CTGATGGAGT CCTTCGTGGC GATTATGGCG
CTGGTTGCTG CGTCCATCAT CGAACCGGGT CTTTACTTCG CGATGAACAC CCCGCCTGCT
GGCCTTGGCA TCACCATGCC TAACCTGCAT GAAATGGGTG GCGAGAACGC GCCGATCATC
ATGGCGCAGC TGAAAGACGT TACCGCACAC GCGGCAGCGA CCGTCAGCTC CTGGGGCTTC
GTGATTTCGC CAGAGCAGAT CCTGCAAACC GCGAAAGACA TTGGTGAGCC TTCTGTCCTG
AACCGTGCAG GTGGCGCGCC GACGCTGGCG GTAGGTATCG CGCACGTGTT CCACAAAGTG
CTGCCGATGG CTGACATGGG CTTCTGGTAT CACTTCGGTA TTCTGTTCGA AGCCCTGTTC
ATCCTGACCG CGCTGGATGC GGGTACCCGT TCTGGCCGCT TTATGCTGCA AGACCTGCTG
GGTAACTTCA TCCCGTTCCT GAAAAAAACC GATTCTCTGG TTGCCGGTAT CATCGGTACT
GCGGGCTGTG TGGGTCTGTG GGGCTACCTG CTGTATCAGG GCGTGGTTGA TCCGCTGGGC
GGCGTTAAGA GCCTGTGGCC GCTGTTCGGT ATCTCCAACC AGATGCTGGC AGCCGTAGCG
CTGGTACTGG GCACCGTTGT GCTGATTAAG ATGAAGCGCA CCCAATACAT CTGGGTAACT
GTTGTTCCGG CTGTATGGCT GCTTATCTGC ACCACCTGGG CGCTGGGCCT GAAACTGTTC
AGCACCAACC CGCAGATGGA AGGCTTCTTC TACATGGCAA GCCAGTACAA AGAGAAGATT
GCTAACGGTA CTGACCTGAC GGCGCAGCAG ATTGCCAATA TGAACCACAT CGTTGTGAAC
AACTACACCA ACGCAGGTCT GAGTATTCTG TTCCTGATTG TGGTGTACAG CATCATCTTC
TACGGTTTCA AAACCTGGCT TGCGGTGCGT AACAGCGACA AACGTACTGA CAAAGAAACA
CCGTACGTTC CAATCCCGGA AGGCGGCGTG AAGATCTCTT CGCACCACTA A
 
Protein sequence
MDTKKIFKHI PWVILGIIGA FCLAVVALRR GEHISALWIV VASVSVYLVA YRYYSLYIAQ 
KVMKLDPTRA TPAVINNDGL NYVPTNRYVL FGHHFAAIAG AGPLVGPVLA AQMGYLPGTL
WLLAGVVLAG AVQDFMVLFI SSRRNGASLG EMIKEEMGPV PGTIALFGCF LIMIIILAVL
ALIVVKALAE SPWGVFTVCS TVPIALFMGI YMRFIRPGRV GEVSVIGIVL LVASIYFGGV
IAHDPYWGPA LTFKDTTITF ALIGYAFVSA LLPVWLILAP RDYLATFLKI GVIVGLALGI
VVLNPELKMP AMTQYIDGTG PLWKGALFPF LFITIACGAV SGFHALISSG TTPKLLANET
DARFIGYGAM LMESFVAIMA LVAASIIEPG LYFAMNTPPA GLGITMPNLH EMGGENAPII
MAQLKDVTAH AAATVSSWGF VISPEQILQT AKDIGEPSVL NRAGGAPTLA VGIAHVFHKV
LPMADMGFWY HFGILFEALF ILTALDAGTR SGRFMLQDLL GNFIPFLKKT DSLVAGIIGT
AGCVGLWGYL LYQGVVDPLG GVKSLWPLFG ISNQMLAAVA LVLGTVVLIK MKRTQYIWVT
VVPAVWLLIC TTWALGLKLF STNPQMEGFF YMASQYKEKI ANGTDLTAQQ IANMNHIVVN
NYTNAGLSIL FLIVVYSIIF YGFKTWLAVR NSDKRTDKET PYVPIPEGGV KISSHH