Gene Cthe_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1603 
Symbol 
ID4809594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1934107 
End bp1934982 
Gene Length876 bp 
Protein Length291 aa 
Translation table11 
GC content41% 
IMG OID640107021 
Productphosphate ABC transporter, inner membrane subunit PstA 
Protein accessionYP_001038022 
Protein GI125974112 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0581] ABC-type phosphate transport system, permease component 
TIGRFAM ID[TIGR00974] phosphate ABC transporter, permease protein PstA 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0162233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCCA TCAACAAACT ATCATATATT CAAAAACTCA GGACTTATAA ACGAGACCCC 
AGATCGCTAG TTTTATTCCT TTTAGTAATT GTATCAACAA TTCTTACGAT TGGGCTTTTG
CTTTTTTTGA TTGGGTATAT CATGATCAAG GGAATCCCCC ATATCAAGCC TGAACTGTTT
CAGTGGGAAT ACAACACCTT AAACGTGTCG CTGATGCCTG CTTTAATTAA TACTGTTATC
ATAACGATCA TTTCGCTTCT GATTGCGGCA CCTATAGGAG TTTTTTCTGC TATCTATCTG
GTTGAATATG CAAAAAAAGG CAACAAGTTG GTTAGCATAA TCAGAATTAC TGCAGAAACA
CTGTCAGGGA TTCCGTCAAT TGTTTACGGC CTGTTTGGTT TGCTGTTCTT TGTAACTGCT
CTTGGCTGGG GGATGTCTCT CCTGGCAGGG GCATTCACAC TGTCAATTAT GATACTGCCA
CTAATTATGC GTACCACCGA GGAAGCATTA AAAGCAGTTC CCAACTCGTA TCGAGAAGGC
AGTTATGGGC TGGGTGCAGG TAAGCTAAGG ACTGTATTTA AAATCGTTTT ACCCTCTGCA
ATGCCTGGCA TTTTGGCCGG CGTCATCCTG GGCATCGGAA GGATTGTAGG AGAAAGCGCC
GCGTTAATCT ATACGGCTGG AACTGTAGCT GAAATACCAA AGGGCAGCGA TTTTCTGTTT
GATTCTACCC GAACATTATC GGTTCATATG TATGCTCTTG CCAGTGAAGG GCTATATGTA
AACCAGTCTT ACGCAACTGC TGTTATACTA TTAATTATTG TTGTATTGAT TAACTACCTA
TCAGGGTTTA TATCAAAGAA ACTTTCAAAA GTCTAA
 
Protein sequence
MESINKLSYI QKLRTYKRDP RSLVLFLLVI VSTILTIGLL LFLIGYIMIK GIPHIKPELF 
QWEYNTLNVS LMPALINTVI ITIISLLIAA PIGVFSAIYL VEYAKKGNKL VSIIRITAET
LSGIPSIVYG LFGLLFFVTA LGWGMSLLAG AFTLSIMILP LIMRTTEEAL KAVPNSYREG
SYGLGAGKLR TVFKIVLPSA MPGILAGVIL GIGRIVGESA ALIYTAGTVA EIPKGSDFLF
DSTRTLSVHM YALASEGLYV NQSYATAVIL LIIVVLINYL SGFISKKLSK V