Gene EcHS_A3442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3442 
Symbol 
ID5594583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3444980 
End bp3446920 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content51% 
IMG OID640922560 
Productregulatory protein CsrD 
Protein accessionYP_001460048 
Protein GI157162730 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000473717 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTAA CGACGAAATT TTCGGCCTTT GTTACGCTGC TCACCGGGTT AACAATTTTT 
GTGACTTTGC TGGGCTGTTC GCTAAGTTTC TACAACGCCA TTCAGTATAA GTTTAGTCAT
CGCGTTCAGG CGGTGGCGAC GGCGATTGAT ACCCACCTTG TGTCGAATGA CTTCAGCGTA
TTAAGGCCAC AAATTACCGA ATTAATGATG TCGGCAGATA TCGTTCGTGT AGACCTGCTC
CATGGTGATA AACAGGTTTA TACCCTGGCC AGAAATGGTA GTTATCGTCC AGTTGGCTCC
AGCGATCTGT TTCGCGAACT GAGCGTTCCG TTGATAAAGC ATCCGGGGAT GTCGTTGCGT
CTGGTTTATC AGGATCCGAT GGGCAACTAT TTCCATTCGT TGATGACCAC CGCGCCGCTC
ACGGGGGCGA TTGGCTTTAT CATTGTTATG CTCTTCCTGG CGGTACGCTG GTTACAACGG
CAACTTGCCG GGCAAGAATT GCTGGAAACC CGGGCTACTC GTATCTTAAA CGGTGAGCGT
GGCTCTAATG TGTTGGGAAC CATCTATGAA TGGCCGCCCA GAACCAGCAG TGCGCTGGAT
ACGCTGCTTC GTGAAATTCA GAACGCACGC GAACAACACA GCCGTCTTGA TACGCTGATC
CGCTCTTATG CCGCCCAGGA CGTGAAAACC GGCCTCAATA ACCGACTCTT TTTCGATAAT
CAGTTAGCAA CGTTACTGGA AGATCAGGAG AAAGTAGGTA CCCACGGGAT CGTGATGATG
ATTCGTCTGC CGGATTTCAA TATGTTGAGC GATACCTGGG GGCACAGCCA GGTTGAAGAA
CAGTTCTTCA CTCTGACGAA TCTGCTGTCG ACATTTATGA TGCGCTACCC TGGCGCACTG
CTGGCGCGTT ACCACCGCAG TGATTTTGCT GCGCTGTTAC CGCACCGGAC GTTAAAAGAG
GCAGAGAGCA TCGCCGGTCA GTTAATCAAA GCCGTTGATA CCTTGCCGAA CAATAAAATG
CTCGATCGCG ACGATATGAT CCACATTGGT ATCTGCGCCT GGCGTAGTGG TCAGGATACC
GAGCAGGTAA TGGAACATGC AGAGTCTGCC ACGCGTAATG CGGGATTGCA GGGCGGCAAT
AGCTGGGCTA TTTACGATGA CTCGTTGCCT GAAAAAGGAC GCGGTAATGT TCGCTGGCGT
ACGCTTATCG AGCAAATGCT CAGTCGCGGC GGCCCGCGCC TTTATCAAAA ACCGGCGGTT
ACTCGCGAAG GTCAGGTTCA TCATCGCGAA CTCATGTGCC GCATCTTCGA TGGTAATGAA
GAGGTTAGCT CGGCGGAGTA TATGCCGATG GTCTTGCAGT TTGGCTTATC GGAAGAGTAT
GACCGTCTGC AAATCAGCCG TCTTATTCCA CTATTGCGTT ACTGGCCAGA GGAAAATCTG
GCGATTCAGG TTACCGTTGA GTCGCTGATT CGCCCGCGTT TTCAGCGTTG GCTGCGCGAT
ACGTTAATGC AATGTGAAAA ATCACAACGA AAACGCATAA TTATTGAACT TGCAGAGGCC
GATGTAGGTC AACATATCAG TCGTTTACAA CCTGTTATTC GTTTAGTGAA TGCTTTAGGG
GTACGGGTAG CCGTCAACCA GGCTGGTTTG ACGCTGGTAA GTACCAGTTG GATCAAAGAA
CTTAATGTTG AGTTACTCAA GCTCCATCCG GGGCTGGTCA GAAACATTGA GAAGCGAACG
GAGAACCAGC TGCTGGTTCA AAGCCTGGTG GAAGCCTGCT CCGGGACCAG CACCCAGGTT
TACGCCACCG GCGTGCGTTC GCGAAGCGAG TGGCAGACCC TGATTCAGCG CGGTGTTACA
GGCGGGCAAG GGGATTTTTT CGCGTCCTCA CAGCCACTTG ATACTAACGT GAAAAAATAT
TCACAAAGAT ACTCGGTTTA A
 
Protein sequence
MRLTTKFSAF VTLLTGLTIF VTLLGCSLSF YNAIQYKFSH RVQAVATAID THLVSNDFSV 
LRPQITELMM SADIVRVDLL HGDKQVYTLA RNGSYRPVGS SDLFRELSVP LIKHPGMSLR
LVYQDPMGNY FHSLMTTAPL TGAIGFIIVM LFLAVRWLQR QLAGQELLET RATRILNGER
GSNVLGTIYE WPPRTSSALD TLLREIQNAR EQHSRLDTLI RSYAAQDVKT GLNNRLFFDN
QLATLLEDQE KVGTHGIVMM IRLPDFNMLS DTWGHSQVEE QFFTLTNLLS TFMMRYPGAL
LARYHRSDFA ALLPHRTLKE AESIAGQLIK AVDTLPNNKM LDRDDMIHIG ICAWRSGQDT
EQVMEHAESA TRNAGLQGGN SWAIYDDSLP EKGRGNVRWR TLIEQMLSRG GPRLYQKPAV
TREGQVHHRE LMCRIFDGNE EVSSAEYMPM VLQFGLSEEY DRLQISRLIP LLRYWPEENL
AIQVTVESLI RPRFQRWLRD TLMQCEKSQR KRIIIELAEA DVGQHISRLQ PVIRLVNALG
VRVAVNQAGL TLVSTSWIKE LNVELLKLHP GLVRNIEKRT ENQLLVQSLV EACSGTSTQV
YATGVRSRSE WQTLIQRGVT GGQGDFFASS QPLDTNVKKY SQRYSV