Gene EcolC_3781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3781 
Symbol 
ID6067637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4136871 
End bp4137893 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content57% 
IMG OID641603194 
Productmonosaccharide-transporting ATPase 
Protein accessionYP_001726713 
Protein GI170021759 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCAAT CTCTCCCGGA CACTACGCCG CCGAAAAGGC GCTTTCGCTG GCCAACGGGA 
ATGCCGCAGC TGGCGGCACT GTTGCTGGTG CTGCTGGTCG ATAGCCTGGT GGCCCCGCAT
TTCTGGCAGG TGGTGCTCCA GGATGGGCGT TTATTCGGTA GCCCCATAGA CATTCTTAAC
CGTGCGGCCC CCGTTGCGCT GTTGGCTATC GGAATGACGC TGGTGATCGC AACAGGTGGG
ATTGATCTCT CCGTGGGGGC GGTGATGGCT ATCGCCGGAG CCACAACGGC TGCGATGACG
GTCGCGGGAT TCAGCCTGCC GATTGTTTTG TTAAGCGCCC TGGGCACTGG CATCCTGGCG
GGATTGTGGA ACGGCATACT GGTAGCGATC CTCAAAATTC AGCCGTTTGT TGCCACCCTG
ATCCTGATGG TCGCCGGGCG CGGCGTGGCG CAACTGATCA CCTCCGGACA GATCGTCACG
TTTAACTCGC CGGATCTCTC ATGGTTTGGC AGTGGATCGC TGTTGTTCCT GCCAACGCCG
GTCATTATCG CGGTGCTGAC GCTTATCCTG TTCTGGCTGT TGACCCGCAA AACGGCGCTG
GGGATGTTTA TCGAAGCCGT TGGTATCAAC ATTCGGGCGG CAAAAAATGC CGGGGTAAAC
ACGCGGATCA TCGTCATGCT TACCTACGTG TTGAGCGGGC TGTGTGCGGC GATTGCGGGC
ATTATCGTGG CGGCGGATAT TCGCGGTGCC GATGCCAATA ACGCCGGGTT ATGGCTGGAG
CTGGACGCCA TTCTCGCGGT GGTTATTGGC GGCGGATCGC TGATGGGCGG ACGTTTTAAC
CTACTGCTTT CGGTGGTGGG GGCGCTGATT ATTCAGGGGA TGAACACCGG AATTTTGCTT
TCGGGCTTTC CGCCAGAGAT GAACCAGGTT GTAAAAGCGG TGGTGGTTCT TTGCGTGCTG
ATTGTCCAGT CGCAACGCTT TATCAGTCTG ATTAAAGGAG TACGTAGCCG TGATAAAACG
TAA
 
Protein sequence
MPQSLPDTTP PKRRFRWPTG MPQLAALLLV LLVDSLVAPH FWQVVLQDGR LFGSPIDILN 
RAAPVALLAI GMTLVIATGG IDLSVGAVMA IAGATTAAMT VAGFSLPIVL LSALGTGILA
GLWNGILVAI LKIQPFVATL ILMVAGRGVA QLITSGQIVT FNSPDLSWFG SGSLLFLPTP
VIIAVLTLIL FWLLTRKTAL GMFIEAVGIN IRAAKNAGVN TRIIVMLTYV LSGLCAAIAG
IIVAADIRGA DANNAGLWLE LDAILAVVIG GGSLMGGRFN LLLSVVGALI IQGMNTGILL
SGFPPEMNQV VKAVVVLCVL IVQSQRFISL IKGVRSRDKT