Gene EcolC_2999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2999 
Symbol 
ID6065931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3278391 
End bp3279818 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content48% 
IMG OID641602416 
Productheat shock protein DnaJ domain-containing protein 
Protein accessionYP_001725951 
Protein GI170020997 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATT GCTGGAAGAT CCTCGATATA GAGGAAACGA CTGACGTCGA TATTATCCGC 
CGCGCTTATC TGGCGCTGTT ACCGTCCTTT CATCCAGAAA CCGATCCGCA GGGTTTTAAA
CAACTTCGTC AGGCGTATGA GGAGGCGCTA CGGATTGCGC AGTCGCCTGC TAAATCTGTT
TGGCAACCAG AAGAATATGA GGTAGCAGAA CATGAAATTC TGCTCGCCTT TCGTGCGTTA
CTTGCCTCTG ATAGTGAACG TTTTCTGCCC TCCGCCTGGC AGCGATTCAT TCAGCAATTA
AATTATTGCT CGATGGAGGA GATTGATGAA TTACGCTGGT CGCTGTGCAC AATCGCCATG
AACACTGCCC ATTTATCCTT CGAGTGCGTG GTGTTATTAG CAGAAAGATT GCGGTGGTTA
CAGGAGGAAA ATACCGGGGA AATAGACGAA GAAGAACTGG AATCCTTTTT ATATGCCATT
GCGAAGGGGA ATGTTTTTAA CTTCCAGACC ATTCTGCATC TGCCCGTTGC CGTGCAAAAT
GACACCATCG ATTTTTACCA AATGTTCGCT CGGATTTGGT CATCGCATCC AGAATGGCTG
ACATTGTATT TAGCGCAACA TCGCGCAGTG ATTATCCCCG ATGATGCAAA ACTGCACAGA
AATTTACTCC GCTGGTATAG CGCAGGTCGC CTGGATATCC CCGAACTTCT GGATTACGCC
CAGTCATGGC GGGAAACTGA ACCTGATAAT GAAGATGCGC CTTATTATGA ATACGCGCAA
CGCGTCTATT GTGGAGAAGG CGAAAGCCTG TTGGCAGAAC TTTGTGACTA CTGGCGCGAG
TATCCCTCCA CCCAGGCGGA TGCTTTAATG TTGCAATGGT GCCGTCAGCA TCGGGTCGAT
TATTACCCAT TACTGGTGAT GATGATTGAA GCGCGTGATC TGGTTAACGA TCAGGGAAAA
CCGCTACTTT ATGTCCCCGG CGACAGCGCC CGTACGCGTT TTCATTTATA CGAAATACTC
AGCGATGAAA AACTCTCTGC GCTGGGGCGT TCACTGGTCG AGATGGTTTT GCACAAAGGA
CGTAAGCCGC GGCTCTCACT CACGCGTGAT ACAGAACATA CCTTATGGCC ATTATATCTA
GTTGCCAAAC AATTAGTGCA GGCCTGCCAA CCTACAGAAG AATCATTAAT GCCGATTGTG
AGCCGCCTTG ATGCAGAAAA TCGTTGTCCA CTGGAAGCAT TAATTATTCG TCGATTATTA
ATTCAGGCGG CGAATTTTAC CGAGAAGCAA ACTGTTGAAC CGGAGCCGCA ACCGCAGCCA
ATGCCCGTTG ACGATGGTGG GCCAGGCTGT CTGGGCATCA TTAAAATTAT TTTCTATATT
TTTATCTTTG CTGGTTTGAT AGGGAAAATA CTCCATCTGT TCGGGTGA
 
Protein sequence
MKNCWKILDI EETTDVDIIR RAYLALLPSF HPETDPQGFK QLRQAYEEAL RIAQSPAKSV 
WQPEEYEVAE HEILLAFRAL LASDSERFLP SAWQRFIQQL NYCSMEEIDE LRWSLCTIAM
NTAHLSFECV VLLAERLRWL QEENTGEIDE EELESFLYAI AKGNVFNFQT ILHLPVAVQN
DTIDFYQMFA RIWSSHPEWL TLYLAQHRAV IIPDDAKLHR NLLRWYSAGR LDIPELLDYA
QSWRETEPDN EDAPYYEYAQ RVYCGEGESL LAELCDYWRE YPSTQADALM LQWCRQHRVD
YYPLLVMMIE ARDLVNDQGK PLLYVPGDSA RTRFHLYEIL SDEKLSALGR SLVEMVLHKG
RKPRLSLTRD TEHTLWPLYL VAKQLVQACQ PTEESLMPIV SRLDAENRCP LEALIIRRLL
IQAANFTEKQ TVEPEPQPQP MPVDDGGPGC LGIIKIIFYI FIFAGLIGKI LHLFG