Gene Ent638_1455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1455 
Symbol 
ID5114420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1608672 
End bp1609634 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content53% 
IMG OID640491641 
Productalkanesulfonate transporter substrate-binding subunit 
Protein accessionYP_001176186 
Protein GI146311112 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0687578 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAAT ACCTCTTTCG TCTTGGGCTA ACGAGTTTGC TGGCCGTCTC CGCTCTGGCT 
CATGCGGCAA ACTCTGCGCC AGAAAGTTTA CGTATCGGCT ACCAGAAAGG CAGCATCAGC
ATGGTGCTGG CAAAGAGTCA TCAGCTGCTG GAAAAACGTT ATCCGCAGAC CCAGTTTTCG
TGGATTGAAT TCCCGGCTGG CCCGCAAATG CTCGAAGCAC TCAATGTGGG CAGCATTGAT
ATAGGCAGTA CGGGCGATAT ACCGCCGATA TTCGCCCAAG CCGCGGGTGC AGATTTGGTC
TACGTCGGCG TCGAACCCGC TAAGCCTAAA GCCGAAGTCA TTCTGGTGCC TGAAAACAGT
CCGATTAAGA CCGTCGCGGA TCTTAAAGGC CATAAAGTTG CGTTCCAGAA AGGTTCCAGT
TCGCACAACC TGCTGCTCCG TGCACTGCAG CAGGCAGGCC TGACGTTCAA GGATATTCAG
CCGATCTACT TAAGCCCGGC CGATGCGCGC GCTGCTTTCC AGCAAAATAA CGTGGATGCC
TGGGCCATTT GGGATCCTTA CTATTCTGCA GCTCTGCTGC AGGGTGGCGT TCGCGTTCTG
AAAGATGGCG AAACGCTTAA ACAGACCGGT TCGTTCTATC TTGCGGCGCG ACCTTATGCC
GAAAAGAACG GTGAATTTGT TCAAGGTGTG CTGAATACCT TCAGTGAAGC GGATGCGCTC
ACCCAAAGCC AGCGTCAGGA GAGTATCACC CTGCTGGCAA AAACCATGGG CCTGCCCGAA
CCGGTGATCG CCAGCTATCT GGATCATCGG CCAACCACCG TGATCAAACC AGTTGATGCC
CACACGGCGG TATTACAGCA ACAAACCGCG GATCTGTTTT ATGAAAACCG TCTGCTCCCG
AAAAAGATCG ATATTCGCGA CCGTATCTGG CAACCCGCTG GCAAAGAAGG ATCGAAATCA
TGA
 
Protein sequence
MFKYLFRLGL TSLLAVSALA HAANSAPESL RIGYQKGSIS MVLAKSHQLL EKRYPQTQFS 
WIEFPAGPQM LEALNVGSID IGSTGDIPPI FAQAAGADLV YVGVEPAKPK AEVILVPENS
PIKTVADLKG HKVAFQKGSS SHNLLLRALQ QAGLTFKDIQ PIYLSPADAR AAFQQNNVDA
WAIWDPYYSA ALLQGGVRVL KDGETLKQTG SFYLAARPYA EKNGEFVQGV LNTFSEADAL
TQSQRQESIT LLAKTMGLPE PVIASYLDHR PTTVIKPVDA HTAVLQQQTA DLFYENRLLP
KKIDIRDRIW QPAGKEGSKS