Gene EcolC_2996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2996 
Symbol 
ID6065904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3275570 
End bp3277021 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content48% 
IMG OID641602413 
Productheat shock protein DnaJ domain-containing protein 
Protein accessionYP_001725948 
Protein GI170020994 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAT GTTGGCAAAT CCTCGAAATT GAAAGCACGA CGCAAATAGA CATTATCCGC 
CAGGCTTATC TTGCTCGCTT ACCGTTGTGT CATCCCGAAA CCGATCCGCA AGGGTTTAAA
GCATTACGCC AGGCCTATGA AGAGGCCCTG CGACTGGCGG TAAATCCTGT CGAGGAAGCA
GATGATGAAG AAAAAGATGC TGCCGCTGAA CATGAAATAC TACGTGCATT CAGGACATTA
CTGGATTCAG AAAGTGATCG TTTTCAGCCT TCCGCCTGGC AGAAATTTAT TCAGCAATTA
AATACCTGGA ACATGGAGGA TGTCGATCAA TTACGCTGGC CGCTGTGTGC AATCGCCATA
GAAGCGCGAT ATCTTTCATT AAATTGTGCT TCTTTGCTGG CAGAGCGTTT GAACTGGCAT
TCATTTAATG ACAGCGAAGG AATGGATGAG GAAGAAAGGG AGGCTTTTCT TGAGGCCATT
CAGGCTGGTG ATTGTTTCGA TTTCCTTAGC CTTCTGGAAT ATCCCATTGC GTTGCAGAAC
CAGACTGTTG AGTATTACTT CGCGCTGGAA CGTTGCTGCC GTTACCATCC TGACTATGTC
ACTGCGTTTT TGGCGATGGA AGGTCCGTGG TTAATTCCTG ATGATGCAAA GTTACATCGC
AAACTGTTGC GCTGGTACAG CTCGGTGCAA ACAGGTATGG CGGAACTCAT TCCTGTCGCT
CAACAGTGGC AAACGGAAGA ACCAGAAAGC GAAGATGCCC GGTATTACTT GTGTGCACAA
CGTTTGTACT GCGGCGAGGG GGAAAGCCTG CTTGCCGATC TCTGCGCGTA CTGGGAAAGT
TACCCATCTA CACAAGCTGA TAATTTGTTG TTGCAGTGGA GCAAGCGTCA TTGCCCGGAT
TATTTCGCGT TATTAGTGAT GGTTATCGAA GCGCGGAGCA TGGTAGATGC GCAAGGTCAA
CCGCTGAAAT ATGTTCCTGG TGAGAGCGCC CGGACGCGGC TGTTATGGGC GGAGATTTTA
CATAGCGGAA AATTATCGCC GTTAGGTCAA TCGTTTATTG AGTCGTTATT CTTCAAGCGC
AAAGCATGGG CGTGGTGGAA ATCGAGAGTC GGTAGCGAGA CAGAGCAAGA TTCACCGTTC
CTGGATTTGT ATCGGGTAGC GGAACAGGTA GTACTTGAAG CGTTTCCGAA ACAAGAGATG
CTGGCCCGTC TTAATACAAG GCTGGAAGGC GGAGATGCTC ATCCATTAGA GGCCATTGTC
ACCCGGATGC TTTTGACGAA AGTGAAACTC GAGCCGGAGG ATGAAGATGT CGATGAGCCA
ACACCTGAAA ATCATGAAGA AAAAAATGAT GAGGGTGAAA AACCACAGAG CATTACCAGC
ATTATCAAAA TCAGTTTAAC GGTGCTGGTG ATAGGTTATG CTCTCGGCAA AATCGCGATG
TTGTTTAGCT GA
 
Protein sequence
MKTCWQILEI ESTTQIDIIR QAYLARLPLC HPETDPQGFK ALRQAYEEAL RLAVNPVEEA 
DDEEKDAAAE HEILRAFRTL LDSESDRFQP SAWQKFIQQL NTWNMEDVDQ LRWPLCAIAI
EARYLSLNCA SLLAERLNWH SFNDSEGMDE EEREAFLEAI QAGDCFDFLS LLEYPIALQN
QTVEYYFALE RCCRYHPDYV TAFLAMEGPW LIPDDAKLHR KLLRWYSSVQ TGMAELIPVA
QQWQTEEPES EDARYYLCAQ RLYCGEGESL LADLCAYWES YPSTQADNLL LQWSKRHCPD
YFALLVMVIE ARSMVDAQGQ PLKYVPGESA RTRLLWAEIL HSGKLSPLGQ SFIESLFFKR
KAWAWWKSRV GSETEQDSPF LDLYRVAEQV VLEAFPKQEM LARLNTRLEG GDAHPLEAIV
TRMLLTKVKL EPEDEDVDEP TPENHEEKND EGEKPQSITS IIKISLTVLV IGYALGKIAM
LFS