Gene Ent638_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2021 
Symbol 
ID5113437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2193710 
End bp2195080 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content55% 
IMG OID640492209 
ProductPTS system, sucrose-specific IIBC subunit 
Protein accessionYP_001176748 
Protein GI146311674 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR00852] PTS system, maltose and glucose-specific subfamily, IIC component
[TIGR01996] PTS system, sucrose-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.139063 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0932593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTCG ATAAAATTGC CCACGCGCTG ATTCCGCTGC TCGGCGGAAA AGAGAACATT 
GCCAGCGCGG CGCACTGTGC CACGCGTTTA CGGCTAGTCC TGGTCGATGA TGCGTTGGCT
GACCAGCAGG CGATCGGCAA GGTTGACGGG GTCAAAGGGT GTTTTCGCAA CGCGGGGCAA
ATGCAGATTA TTTTCGGCAC CGGCGTGGTG AATAAAGTCT ATGCGGCGTT CATTCAGGCC
ACTGGCATAG GCGAATCCAC CAAGTCTGAA GCCGCCGACA TCGCGGCGCG TAAGCTGAAT
CCGTTCCAGC GGATCGCGCG TCTGCTGTCG AATATTTTCG TGCCTATTAT TCCTGCCATC
GTCGCATCGG GACTGCTGAT GGGCTTGCTG GGGATGGTAA AAACTTACGG TTGGGTGAAT
GCCGATAACG CGATCTACAT CATGCTCGAC ATGTGCAGTT CTGCGGCGTT TATCATTTTG
CCGATCCTGA TTGGTTTTAC CGCCGCCCGT GAATTTGGCG GCAACCCGTA CCTGGGCGCG
ACGCTTGGCG GCATCCTGAC GCATCCGGCG TTGACCAACG CCTGGGGGGT GGCATCGGGC
TTCCATACCA TGGACTTTTT TGGCCTCGAC ATTGCCATGA TCGGTTACCA GGGCACGGTA
TTCCCGGTGC TGTTGGCTGT GTGGTTTATG AGCCATGTTG AAAAACAACT CCGCCGCGTG
ATCCCGGATG CACTGGACTT AATTCTCACG CCGTTTTTCA CCGTGATTAT TTCCGGCTTT
ATTGCGTTGC TGATTATTGG CCCGGCGGGG CGCGCACTGG GCGACGGTAT CTCCTTCGTA
CTGAGCACGC TGATTGAACA TGCCGGTTGG CTGGCAGGGC TGTTATTCGG TGGCCTGTAC
TCGGTGATTG TCATCACCGG CATTCACCAC AGTTTCCACG CTATCGAAGC GGGATTGCTG
GGAAATCCGT CGATTGGTGT TAACTTCCTG CTGCCGATTT GGGCGATGGC GAACGTGGCG
CAAGGCGGAG CGTGTCTGGC CGTGTGGTTT AAAACTAAAG ACGCGAAGAT TAAAGCAATT
ACCTTACCGT CCGCATTTTC TGCCATGCTC GGTATCACCG AGGCGGCCAT TTTCGGGATT
AACCTTCGCT TTGTGAAGCC GTTTATTGCC GCGTTGATTG GTGGTGCAGC GGGCGGTGCG
TGGGTGGTTT CGGTACATGT TTATATGACG GCAGTGGGCC TGACGGCGAT TCCAGGCATG
GCGATTGTGC AGGCGAACTC GCTGCTGAAC TACATCATCG GCATGGTCAT CGCTTTTGGC
GTCGCCTTTG CGCTGTCTTT ACTGCTGAAA TACAAAACGG ACTCTGAATA A
 
Protein sequence
MDFDKIAHAL IPLLGGKENI ASAAHCATRL RLVLVDDALA DQQAIGKVDG VKGCFRNAGQ 
MQIIFGTGVV NKVYAAFIQA TGIGESTKSE AADIAARKLN PFQRIARLLS NIFVPIIPAI
VASGLLMGLL GMVKTYGWVN ADNAIYIMLD MCSSAAFIIL PILIGFTAAR EFGGNPYLGA
TLGGILTHPA LTNAWGVASG FHTMDFFGLD IAMIGYQGTV FPVLLAVWFM SHVEKQLRRV
IPDALDLILT PFFTVIISGF IALLIIGPAG RALGDGISFV LSTLIEHAGW LAGLLFGGLY
SVIVITGIHH SFHAIEAGLL GNPSIGVNFL LPIWAMANVA QGGACLAVWF KTKDAKIKAI
TLPSAFSAML GITEAAIFGI NLRFVKPFIA ALIGGAAGGA WVVSVHVYMT AVGLTAIPGM
AIVQANSLLN YIIGMVIAFG VAFALSLLLK YKTDSE