Gene ECD_01097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01097 
SymbolptsG 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1160436 
End bp1161869 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content53% 
IMG OID 
Productfused glucose-specific PTS enzymes: IIB component/IIC component 
Protein accessionACT42992 
Protein GI253977322 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0550337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAAGA ATGCATTTGC TAACCTGCAA AAGGTCGGTA AATCGCTGAT GCTGCCGGTA 
TCCGTACTGC CTATCGCAGG TATTCTGCTG GGCGTCGGTT CCGCGAATTT CAGCTGGCTG
CCCGCCGTTG TATCGCATGT TATGGCAGAA GCAGGCGGTT CCGTCTTTGC AAACATGCCA
CTGATTTTTG CGATCGGTGT CGCCCTCGGC TTTACCAATA ACGATGGCGT ATCCGCGCTG
GCCGCAGTTG TTGCCTATGG CATCATGGTT AAAACCATGG CCGTGGTTGC GCCACTGGTA
CTGCATTTAC CTGCTGAAGA AATCGCCTCT AAACACCTGG CGGATACTGG CGTACTCGGA
GGGATTATCT CCGGTGCGAT CGCAGCGTAC ATGTTTAACC GTTTCTACCG TATTAAGCTG
CCTGAGTATC TTGGCTTCTT TGCCGGTAAA CGCTTTGTGC CGATCATTTC TGGCCTGGCT
GCCATCTTTA CTGGCGTTGT GCTGTCCTTC ATTTGGCCGC CGATTGGTTC TGCAATCCAG
ACCTTCTCTC AGTGGGCTGC TTACCAGAAC CCGGTAGTTG CGTTTGGCAT TTACGGTTTC
ATCGAACGTT GCCTGGTACC GTTTGGTCTG CACCACATCT GGAACGTACC TTTCCAGATG
CAGATTGGTG AATACACCAA CGCAGCAGGT CAGGTTTTCC ACGGCGACAT TCCGCGTTAT
ATGGCGGGTG ACCCGACTGC GGGTAAACTG TCTGGTGGCT TCCTGTTCAA AATGTACGGT
CTGCCAGCTG CCGCAATTGC TATCTGGCAC TCTGCTAAAC CAGAAAACCG CGCGAAAGTG
GGCGGTATTA TGATCTCCGC GGCGCTGACC TCGTTCCTGA CCGGTATCAC CGAGCCGATC
GAGTTCTCCT TCATGTTCGT TGCGCCGATC CTGTACATCA TCCACGCGAT TCTGGCAGGC
CTGGCATTCC CAATCTGTAT TCTTCTGGGG ATGCGTGACG GTACGTCGTT CTCGCACGGT
CTGATCGACT TCATCGTTCT GTCTGGTAAC AGCAGCAAAC TGTGGCTGTT CCCGATCGTC
GGTATCGGTT ATGCGATTGT TTACTACACC ATCTTCCGCG TGCTGATTAA AGCACTGGAT
CTGAAAACGC CGGGTCGTGA AGACGCGACT GAAGATGCAA AAGCGACAGG TACCAGCGAA
ATGGCACCGG CTCTGGTTGC TGCATTTGGT GGTAAAGAAA ACATTACTAA CCTCGACGCA
TGTATTACCC GTCTGCGCGT CAGCGTTGCT GATGTGTCTA AAGTGGATCA GGCCGGCCTG
AAGAAACTGG GCGCAGCGGG CGTAGTGGTT GCTGGTTCTG GTGTTCAGGC GATTTTCGGT
ACTAAATCCG ACAACCTGAA AACCGAGATG GATGAGTACA TCCGTAACCA CTAA
 
Protein sequence
MFKNAFANLQ KVGKSLMLPV SVLPIAGILL GVGSANFSWL PAVVSHVMAE AGGSVFANMP 
LIFAIGVALG FTNNDGVSAL AAVVAYGIMV KTMAVVAPLV LHLPAEEIAS KHLADTGVLG
GIISGAIAAY MFNRFYRIKL PEYLGFFAGK RFVPIISGLA AIFTGVVLSF IWPPIGSAIQ
TFSQWAAYQN PVVAFGIYGF IERCLVPFGL HHIWNVPFQM QIGEYTNAAG QVFHGDIPRY
MAGDPTAGKL SGGFLFKMYG LPAAAIAIWH SAKPENRAKV GGIMISAALT SFLTGITEPI
EFSFMFVAPI LYIIHAILAG LAFPICILLG MRDGTSFSHG LIDFIVLSGN SSKLWLFPIV
GIGYAIVYYT IFRVLIKALD LKTPGREDAT EDAKATGTSE MAPALVAAFG GKENITNLDA
CITRLRVSVA DVSKVDQAGL KKLGAAGVVV AGSGVQAIFG TKSDNLKTEM DEYIRNH