Gene EcolC_2142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2142 
Symbol 
ID6066233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2337794 
End bp2338816 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content49% 
IMG OID641601550 
Productsugar ABC transporter periplasmic subunit 
Protein accessionYP_001725109 
Protein GI170020155 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTTC ATCGCTTTAA GAAAATCGCC TTACTTAGCG TTCTTGGCAT TGCCGCAATC 
TCTATGAATG TGCAGGCCGC AGAGCGTATT GCATTTATTC CCAAACTGGT TGGCGTGGGA
TTTTTTACCA GCGGTGGCAA CGGCGCACAA CAAGCGGGTA AAGAGCTGGG CGTTGATGTG
ACCTACGACG GGCCGACAGA ACCCAGTGTT TCTGGTCAGG TACAGTTGAT TAATAACTTC
GTCAATCAAG GTTATAACGC CATTATCGTT TCTGCGGTTT CGCCTGATGG CTTGTGTCCG
GCACTGAAAC GCGCCATGCA ACGTGGTGTG AGAGTGCTGA CCTGGGACTC TGATACTAAA
CCGGAGTGCC GCTCTTACTA CATTAATCAG GGAACGCCCG CCCAGTTGGG AGGTATGTTG
GTGGATATGG CGGCGCGTCA GGTGAATAAA GACAAAGCCA AAGTCGCGTT TTTCTACTCA
AGCCCCACCG TTACGGACCA AAACCAGTGG GTGAAAGAAG CGAAAGCGAA AATCGCCAAA
GAGCATCCTG GCTGGGAAAT TGTCACTACG CAGTTTGGCT ATAACGATGC CACTAAATCA
TTACAAACCG CAGAAGGAAT ATTAAAAGCG TATAGCGATC TCGACGCCAT TATCGCCCCC
GATGCCAACG CCCTGCCCGC TGCCGCACAA GCCGCAGAAA ACTTGAAAAA TGACAAAGTA
GCGATTGTCG GATTCAGTAC GCCAAACGTG ATGCGTCCAT ATGTGGAACG CGGCACGGTG
AAAGAATTTG GCCTGTGGGA TGTGGTTCAG CAAGGCAAAA TTTCAGTGTA TGTCGCGGAT
GCATTATTGA AAAAAGGATC AATGAAAACG GGCGACAAGC TGGATATCCA GGGCGTAGGT
CAGGTTGAAG TCTCGCCAAA TAGCGTTCAG GGCTATGACT ACGAAGCGGA TGGTAATGGC
ATCGTACTGT TACCGGAGCG CGTGATATTC AACAAAGAGA ATATCGGCAA ATACGATTTC
TGA
 
Protein sequence
MTLHRFKKIA LLSVLGIAAI SMNVQAAERI AFIPKLVGVG FFTSGGNGAQ QAGKELGVDV 
TYDGPTEPSV SGQVQLINNF VNQGYNAIIV SAVSPDGLCP ALKRAMQRGV RVLTWDSDTK
PECRSYYINQ GTPAQLGGML VDMAARQVNK DKAKVAFFYS SPTVTDQNQW VKEAKAKIAK
EHPGWEIVTT QFGYNDATKS LQTAEGILKA YSDLDAIIAP DANALPAAAQ AAENLKNDKV
AIVGFSTPNV MRPYVERGTV KEFGLWDVVQ QGKISVYVAD ALLKKGSMKT GDKLDIQGVG
QVEVSPNSVQ GYDYEADGNG IVLLPERVIF NKENIGKYDF