Gene Ent638_3292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3292 
Symbol 
ID5112121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3595798 
End bp3597084 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content53% 
IMG OID640493499 
Productextracellular solute-binding protein 
Protein accessionYP_001178007 
Protein GI146312933 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.309737 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG TGCTTTTAAG CGCAGCAATC TCCGCTACCC TGGGCCTTAC CGCGCTGCCA 
TCCATGGCGC AAGATGTTGA TTTACGTATG TCCTGGTGGG GCGGCAATGG CCGCCATCAG
GTGACGCTGA AAGCGTTAGA AGAGTTCCAT AAACAGAACC CTGACATCAA CGTCAAAGCA
GAATACACCG GTTGGGACGG TCACTTGTCT CGTCTGACCA CGCAAATCGC GGGCGGCACT
GAGCCAGACG TGATGCAGAC CAACTGGAAC TGGCTGCCAA TTTTCTCGAA AAATGGCGAC
GGTTTCTACG ATCTGAACAA AATGAAAGAC GTGATCGACT TATCTCAGTT TGATCCGAAA
GAGCTGCAGT CCACCACGGT TAACGGCAAG CTAAACGGGA TCCCAATCTC CGTAACGGCG
CGTGTGTTCT ACTTCAACGA TGAAGTGTGG AAAAAAGCGG GCGTCGAGTA CCCGAAAACC
TGGGATGAGC TGAAAGCTGC CGGTAAGGCC TTCGAAAGCA AGCTGGGCAA ACAGTACTAT
CCGGTGGTGC TGGAGCACCA GGATACGCTG GCGCTGCTGA ACTCCTACAT GATTCAGAAG
TACAACGTCC CTGCGGTTGA CGAGAAAGCG AAAAAACTCG CCTGGAGCAA AGAGCAGTGG
GTTGAGGTCT TCCAGACCTA TAAATCCCTG GTTGATAGCC ACGTGATGCC GGACACCAAG
TACTACGCGT CGTTTGGTAA GAGCAACATG TACGAGATGA AGCCGTGGAT CGAGGGTGAA
TGGGGCGGTA CCTACATGTG GAACTCCACC ATCAAAAAAT ATTCCGATAA CCTGAAGCCA
CCAGCAAAAC TGGAGCTGGG TAACTACCCA ATGCTGCCAG GTGCAACCGA TGCGGGCCTG
TTCTTCAAAC CAGCACAGAT GCTCTCTATC GGTAAAACCA CCAAAAACCC AGAAGCCGCT
GCAAAAGTGA TTAACTTCCT GCTGAACAGC AAAGAAGGCG TGCAGACTCT GGGCCTGGAG
CGCGGCGTAC CATTGAGCAA AGTCGCGGTT CAGTACCTGA CCGAAGATGG CACCATCAAA
GAGAGCGATC CGTCTGTTGC GGGTCTGCGC ATGGCGCAGT CTCTGCCAGC CAAACTCTCC
GTGTCACCAT ACTTTGACGA TCCACAGATC GTGGCGCAGT TTGGTACCTC TCTGCAGTAC
ATCGACTACG GCCAGAAAAC CGTGGAAGAG ACCGCGACAG ACTTCCAACG TCAGGCTGAA
CGTATCCTGA AACGCGCAAT GCGCTAA
 
Protein sequence
MKKVLLSAAI SATLGLTALP SMAQDVDLRM SWWGGNGRHQ VTLKALEEFH KQNPDINVKA 
EYTGWDGHLS RLTTQIAGGT EPDVMQTNWN WLPIFSKNGD GFYDLNKMKD VIDLSQFDPK
ELQSTTVNGK LNGIPISVTA RVFYFNDEVW KKAGVEYPKT WDELKAAGKA FESKLGKQYY
PVVLEHQDTL ALLNSYMIQK YNVPAVDEKA KKLAWSKEQW VEVFQTYKSL VDSHVMPDTK
YYASFGKSNM YEMKPWIEGE WGGTYMWNST IKKYSDNLKP PAKLELGNYP MLPGATDAGL
FFKPAQMLSI GKTTKNPEAA AKVINFLLNS KEGVQTLGLE RGVPLSKVAV QYLTEDGTIK
ESDPSVAGLR MAQSLPAKLS VSPYFDDPQI VAQFGTSLQY IDYGQKTVEE TATDFQRQAE
RILKRAMR