Gene EcolC_1642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1642 
Symbol 
ID6064683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1821691 
End bp1823061 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content47% 
IMG OID641601056 
ProductPTS system galactitol-specific IIC component 
Protein accessionYP_001724626 
Protein GI170019672 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3775] Phosphotransferase system, galactitol-specific IIC component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000536368 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAACTGA TCACGCAATT TATAAACGAT CTGGGAAATT TTATATTTAT CCCGGTCATC 
TTTCTGGTAC TGATGAAGAT ACTTGGCCGT CCTCTTTCAG AATGTATCTC ATCTGCCATC
AAAGTCGGCA TTGGTTTCAT TGCGTTAACC ATGACCATCA AACTGATGCT GGAAAAAATG
CAACCGGCAG TCACCGGATT AGCAGAAGCA ACAGGTTCCT CGCTCAGTGC CATCGATGTT
GGTGGCGCAG CGACTGCGGT TATGGGATTT GGCTCCAGCA TGGGCGCTAT CATTATTCCC
CTCTGTGTTG CGGTAAATAT TGCAATGCTG GTCGCCCGCC TGACTGACTG TGTTAACGTT
GATGTTTTCA ACCTTCATCA AAATGCGTCA ATGGGGGCAA TTGTTGGCGT CTATTCTGGT
AGCTTCCTGT ATGGCGCATT GACCGCCGCG CTATTCCATG TATGGGCGCT GATCGCTGCC
GATCTTGGTG CTAAAAATAA CGAAAAATTC TTTAACCTGC CAAAAGGTGT TGCGATCTCT
CACCCGGTTG CCAATACCTA CTTACTTTTC GCTTATCCAT TCAACTGGAT TTATGATCGC
ATCCCAGGCT TCCGTAATCT GAATGTGACC GCCGAAACTA TTCAAAAACG GTTTGGCATT
CTCGGCGATC CAACTATGGT TGGTTTTATT ATTGGTATTT TGTTGGGCTT TTGTGGTTAT
GGCTGGAAAT CCCCATACCA CACCATAATC GCCAGCCTAC AGTTAGGGAT GTATCTTGCT
GCAGTCATGC TTCTGTTGCC ACGTATGACC TCTATCATGA TGGAAGGGCT TGTTCCGCTT
TCCAACGTAG CACGCAAAAA ACTGGTCAAA CGTTTCCCGG ATCGTCAAAT CACTGTTGGT
ATGGACACTG CTCTGATTGT GGGCCATCCA TCAGTTATCG CCCCTGCATT ATTGCTGATC
CCGGTGATTG TGATCCTCGC CGTGATCTTG CCCGGCAACC GCGTTATGCC ACTGGGTGAT
CTCTCTCAGT TTGTGTTTTT CATTGCCTGC ATGGTACCTG TTTTCAATGG CAACATTATT
CGCACCTGGG TGACCTCGAT CATTTTGTTT GGTGGTGGTT TGTATATTGC ATCATGGATG
GCACCGGCTA CCAACGAAGT CTTCCAGAAG TTTGGTACAA ACCCGGATGC CAGCGTGATG
TACTCTTCGC TTAACCCGTC AGCGAATCCA TTTACTGGTC TGTTTGCCGC CCTGAGCCAT
GTTGGAATCA TTGGCTATGT GATGGCAGGT ATCCTTTTGT TATCTATTGG ATACTTAATT
AAACAAAAAT CACGTCGCCA GATTGAAACG GATTTGGAAA AAGCGCTTTA A
 
Protein sequence
MELITQFIND LGNFIFIPVI FLVLMKILGR PLSECISSAI KVGIGFIALT MTIKLMLEKM 
QPAVTGLAEA TGSSLSAIDV GGAATAVMGF GSSMGAIIIP LCVAVNIAML VARLTDCVNV
DVFNLHQNAS MGAIVGVYSG SFLYGALTAA LFHVWALIAA DLGAKNNEKF FNLPKGVAIS
HPVANTYLLF AYPFNWIYDR IPGFRNLNVT AETIQKRFGI LGDPTMVGFI IGILLGFCGY
GWKSPYHTII ASLQLGMYLA AVMLLLPRMT SIMMEGLVPL SNVARKKLVK RFPDRQITVG
MDTALIVGHP SVIAPALLLI PVIVILAVIL PGNRVMPLGD LSQFVFFIAC MVPVFNGNII
RTWVTSIILF GGGLYIASWM APATNEVFQK FGTNPDASVM YSSLNPSANP FTGLFAALSH
VGIIGYVMAG ILLLSIGYLI KQKSRRQIET DLEKAL