Gene Ent638_2951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2951 
Symbol 
ID5111984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3202627 
End bp3203640 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content52% 
IMG OID640493145 
Productthiosulfate transporter subunit 
Protein accessionYP_001177666 
Protein GI146312592 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4150] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.339938 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCATTA CCGTACTGAA AAAAAGCACT TTGGCCATGG CGGGCTTATT GCTGATGGGG 
CAGGCGCAGG CCACTGAGTT GCTCAATAGC TCCTATGATG TCTCGCGCGA GCTTTTTGCG
GCCCTGAACC CACCGTTTGA ACAGCAGTGG GCGAAAGAGA ATAACGGCGA CAAGCTGACC
ATCAAACAAT CTCACGCCGG TTCTTCAAAA CAGGCGCTGG CGATTTTGCA GGGTCTGAAA
GCCGATGTGG TGACGTACAA CCAGATTACC GACGTGCAGA TCCTACATGA CAAAGGTAAC
CTGATCCCAG CGAACTGGCA GAGCCGTTTG CCGAATAACA GCTCGCCGTT CTACTCCACC
ATGGGCTTCC TGGTGCGTAA GGGTAACCCG AAAAATATTC ATAGCTGGAA TGATCTGGTG
CGTCCTGATG TGAAGCTGAT TTTCCCGAAT CCAAAAACGT CCGGTAACGC GCGCTATACC
TATCTGGCAG CATGGGGTGC AGCGGACAAA GCGGACGGTG GTGACAAAGC CAAAACCGAA
CAGTTTATGA CGCAGTTCCT GAAAAACGTC GAAGTGTTTG ATACCGGTGG TCGCGGGGCG
ACAACCACAT TCGCAGAGCG CGGTCTGGGC GATGTGCTGA TCAGTTTCGA ATCGGAAGTG
AACAACATCC GCAAACAGTA TGAAGCGCAG GGTTTCGAGG TGGTGATTCC TGAGACCAAT
ATTCTGGCGG AGTTCCCGGT TGCCTGGGTT GATAAAAATG TCAAAGCCAA CGGGACCGAA
AAGGCTGCGA AGGATTACCT GAATTTCCTT TACAGCCCGC AGGCGCAAAC CATCATCACC
GATTACTACT ATCGCGTGAA CAATCCGGAC GTGATGAACA AACTGAAAGA TAAATTCCCG
CAGACAGAGC TGTTCCGCGT GGAAGACAAG TTTGGCTCGT GGCCGGAAGT GATGAAAACG
CATTTTGTCA CCGGCGGTGA GTTAGACAAA CTGCTGGCGG CGGGGCGTAA GTAA
 
Protein sequence
MVITVLKKST LAMAGLLLMG QAQATELLNS SYDVSRELFA ALNPPFEQQW AKENNGDKLT 
IKQSHAGSSK QALAILQGLK ADVVTYNQIT DVQILHDKGN LIPANWQSRL PNNSSPFYST
MGFLVRKGNP KNIHSWNDLV RPDVKLIFPN PKTSGNARYT YLAAWGAADK ADGGDKAKTE
QFMTQFLKNV EVFDTGGRGA TTTFAERGLG DVLISFESEV NNIRKQYEAQ GFEVVIPETN
ILAEFPVAWV DKNVKANGTE KAAKDYLNFL YSPQAQTIIT DYYYRVNNPD VMNKLKDKFP
QTELFRVEDK FGSWPEVMKT HFVTGGELDK LLAAGRK