Gene EcolC_3783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3783 
Symbol 
ID6066398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4139549 
End bp4140505 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content50% 
IMG OID641603196 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001726715 
Protein GI170021761 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAAAC GCTTACTTGT AGTCTCTGCA GTCTCGGCAG CCATGTCGTC TATGGCGTTG 
GCCGCTCCAT TAACCGTAGG ATTTTCGCAG GTCGGATCGG AATCTGGCTG GCGCGCCGCA
GAAACCAATG TGGCGAAAAG TGAGGCCGAA AAACGCGGAA TTACGCTGAA AATTGCCGAT
GGTCAGCAAA AGCAGGAAAA CCAGATTAAA GCGGTACGTT CCTTCGTCGC GCAAGGGGTG
GATGCGATCT TTATCGCTCC GGTGGTTGCG ACTGGTTGGG AACCGGTATT AAAAGAGGCG
AAAGATGCCG AAATCCCGGT CTTCTTGCTC GATCGTTCTA TTGATGTGAA AGACAAATCT
CTCTATATGA CCACCGTCAC TGCCGACAAC ATTCTCGAAG GCAAGTTGAT TGGTGACTGG
CTGGTAAAAG AAGTGAATGG CAAACCATGC AACGTGGTGG AGCTGCAGGG CACTGTTGGG
GCCAGCGTCG CCATTGACCG TAAGAAAGGC TTTGCCGAAG CCATTAAGAA TGCGCCAAAT
ATCAAAATTA TCCGCTCGCA GTCAGGTGAC TTCACCCGCA GTAAAGGCAA AGAAGTCATG
GAGAGCTTTA TCAAAGCGGA AAACAACGGC AAAAACATCT GCATGGTTTA CGCCCATAAC
GACGACATGG TGATTGGTGC AATTCAGGCA ATTAAAGAAG CGGGCCTGAA ACCAGGCAAA
GATATTCTGA CAGGTTCTAT CGACGGCGTA CCGGATATCT ACAAAGCGAT GATTGATGGC
GAAGCGAACG CCAGTGTTGA ACTGACGCCG AATATGGCAG GTCCCGCCTT CGACGCGCTG
GAGAAATACA AAAAAGACGG CACCATGCCT GAAAAGCTGA CGCTGACCAA ATCCACCCTT
TATCTGCCTG ATACCGCAAA AGAAGAGTTA GAGAAGAAGA AAAATATGGG GTATTGA
 
Protein sequence
MWKRLLVVSA VSAAMSSMAL AAPLTVGFSQ VGSESGWRAA ETNVAKSEAE KRGITLKIAD 
GQQKQENQIK AVRSFVAQGV DAIFIAPVVA TGWEPVLKEA KDAEIPVFLL DRSIDVKDKS
LYMTTVTADN ILEGKLIGDW LVKEVNGKPC NVVELQGTVG ASVAIDRKKG FAEAIKNAPN
IKIIRSQSGD FTRSKGKEVM ESFIKAENNG KNICMVYAHN DDMVIGAIQA IKEAGLKPGK
DILTGSIDGV PDIYKAMIDG EANASVELTP NMAGPAFDAL EKYKKDGTMP EKLTLTKSTL
YLPDTAKEEL EKKKNMGY