Gene EcolC_2145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2145 
Symbol 
ID6066507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2340842 
End bp2342377 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content54% 
IMG OID641601553 
ProductABC transporter related 
Protein accessionYP_001725112 
Protein GI170020158 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACGA GTGATACCCG CGCGTTACCG CTACTTTGCG CCCGCTCGGT TTATAAACAG 
TATTCAGGGG TCAATGTCCT GAAAGGCATC GATTTTACGT TGCATCAGGG GGAGGTCCAC
GCCCTGCTCG GCGGCAATGG TGCCGGTAAA TCGACGTTAA TGAAGATTAT TGCCGGTATT
ACCCCTGCTG ATAGCGGTAC GCTGGAGATT GGGGGCAACA ACTACGCCAG ATTAACGCCA
GTTCATGCTC ATCAGCTGGG TATTTATCTC GTTCCCCAGG AACCGCTGCT TTTCCCAAGC
CTGTCGATAA AAGAAAACAT CCTGTTTGGG CTGGCAAAAA AACAGCTCTC CATGCAGAAA
ATGAAGAACT TGCTGGCGGC GCTGGGCTGC CAGTTTGATC TGCATAGTCT GGCAGGATCG
CTGGATGTCG CCGATCGCCA AATGGTGGAA ATCCTCCGCG GGCTGATGCG CGACTCGCGG
ATTCTGATCC TCGATGAACC TACCGCCTCG CTTACCCCTG CGGAAACCGA ACGCTTGTTT
ACTCGCTTGC AAGAGCTGCT TGCTACTGGC GTGGGTATTG TTTTTATCTC GCATAAGCTG
CCGGAAATTC GCCAGATTGC CGATCGAATT AGCGTGATGC GCGACGGAAC CATCGCCTTA
AGCGGCAAAA CCAGCGAACT GTCTACCGAC GACATTATTC AGGCCATCAC GCCAGTGGTA
CGGGAAAAAT CGCTCTCTGC CAGCCAAAAA TTATGGCTGG AGTTACCTGG TAACCGCCCA
CAACATGCCG CCGGAACGCC GGTGCTGACA CTGGAAAATC TGACTGGCGA AGGCTTCAGG
AATGTCAGCC TGACGCTCAA TGCCGGAGAA ATTCTGGGCC TGGCTGGGCT GGTGGGAGCC
GGACGCACAG AACTGGCCGA GACGCTCTAT GGTCTGCGTA CTTTGCGTGG CGGACGCATT
ATGCTGAATG GTAACGAGAT CAATAAATTA TCCACTGGAG AACGTTTACT GCGCGGTCTG
GTTTATCTGC CGGAAGATCG CCAGTCATCC GGACTGAATC TCGATGCTTC GCTGGCCTGG
AATGTCTGCG CCCTTACTCA TAACCTTCGT GGATTCTGGG CGAAAACCGC GAAAGATAAT
GCCACCCTGG AACGATATCG TCGGGCGCTG AATATTAAAT TTAACCAACC GGAACAAGCT
GCACGGACAT TATCCGGTGG CAACCAGCAA AAAATCCTCA TTGCCAAATG CCTGGAAGCC
TCGCCGCAAG TATTGATTGT CGATGAGCCG ACGCGCGGCG TGGATGTCTC GGCGCGTAAT
GATATCTACC AACTGTTGCG CAGCATCGCC GCACAGAATG TGGCTGTGCT GCTTATCTCC
TCCGACCTGG AAGAGATCGA ACTGATGGCA GATCGCGTGT ATGTGATGCA TCAGGGCGAA
ATTGCCCACT CTGCACTGAC CGGGCGCGAT ATTAATGTCG AGACTATTAT GCGCGTTGCC
TTCGGCGATA GTCAGCGTCA GGAGGCGTCA TGCTGA
 
Protein sequence
MQTSDTRALP LLCARSVYKQ YSGVNVLKGI DFTLHQGEVH ALLGGNGAGK STLMKIIAGI 
TPADSGTLEI GGNNYARLTP VHAHQLGIYL VPQEPLLFPS LSIKENILFG LAKKQLSMQK
MKNLLAALGC QFDLHSLAGS LDVADRQMVE ILRGLMRDSR ILILDEPTAS LTPAETERLF
TRLQELLATG VGIVFISHKL PEIRQIADRI SVMRDGTIAL SGKTSELSTD DIIQAITPVV
REKSLSASQK LWLELPGNRP QHAAGTPVLT LENLTGEGFR NVSLTLNAGE ILGLAGLVGA
GRTELAETLY GLRTLRGGRI MLNGNEINKL STGERLLRGL VYLPEDRQSS GLNLDASLAW
NVCALTHNLR GFWAKTAKDN ATLERYRRAL NIKFNQPEQA ARTLSGGNQQ KILIAKCLEA
SPQVLIVDEP TRGVDVSARN DIYQLLRSIA AQNVAVLLIS SDLEEIELMA DRVYVMHQGE
IAHSALTGRD INVETIMRVA FGDSQRQEAS C