Gene Athe_0850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0850 
Symbol 
ID7407425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp943805 
End bp944743 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content37% 
IMG OID643715228 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_002572738 
Protein GI222528856 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4209] ABC-type polysaccharide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.3931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGATGCAG TATACTACAA TGGTTCAAAG AAAACGTTCT GGCAAAAAGT AAAAGAGCAG 
AAAGAACTTG TTTTTATGAT ATTTCCATTT GTCTTGTATG TAATTTTGTT TCACTACATA
CCACTTTGGT GGTGGGTCAT TGCATTTAAA GAATACAGGC CATTCCAAGG TGTTTGGGGT
TCAGAGTGGG TAGGTTTGCA GCAATTTAAA GATTTATTCA GCGACTCTGG TTTTTGGCTT
GCTATGAGAA ATACAATTGT TATAAGCTTT TTGAAGCTTG TTACGTCCTT TGCAGCAGCT
ATCTTACTTG CATTGATGCT TAACGAAGTG AAGAATATGT TATTTAAAAG AACTATTCAG
ACAATTTCAT ATCTTCCACA CTTTGTTTCT TGGGTTGTGG CGGCGAGTAT AGTTATAAGT
GTGCTTTCAC CTGAGTCAGG TATACTCAAT CAAATTTTGA TGTCACTCAA GATTATTAAA
CAGCCAATTG TCTGGATGGG TGAAGGACAT TATTTCTGGT GGATATTGGC TCTCTCGAAT
GTTTGGAAGG AAACAGGATG GAATGCTATA GTGTATTTGG CAGCAATGAC AAGTATAGAC
CCTGAACTTT ACGATGCTGC AAGCGTGGAT GGTTGTGGAA GACTGCAGAA GATAAGATAT
GTAACACTGC CAGGGATTGC TCCAACAATT AGTATGCTTC TTATTCTCAA CGTTGGTTGG
CTTTTGAATG CTGGTTTTGA ACAGGTTCTT TTGCTCAGAA ACCCCCTTGT TCAGGATTAC
TCTCAAATTC TTGACACCTA TGTTCTTGAT TATGGTATTA CAATGTACAG ATATTCATAT
GCTACAGCTG CTGGTATGTT TAAGAGCGTT GTAAGTATTT TGCTTGTGTT GTTTGCAAAT
AAGGTTGCTG CAAAATTGAA TGCATCAACT GTTGTATAA
 
Protein sequence
MDAVYYNGSK KTFWQKVKEQ KELVFMIFPF VLYVILFHYI PLWWWVIAFK EYRPFQGVWG 
SEWVGLQQFK DLFSDSGFWL AMRNTIVISF LKLVTSFAAA ILLALMLNEV KNMLFKRTIQ
TISYLPHFVS WVVAASIVIS VLSPESGILN QILMSLKIIK QPIVWMGEGH YFWWILALSN
VWKETGWNAI VYLAAMTSID PELYDAASVD GCGRLQKIRY VTLPGIAPTI SMLLILNVGW
LLNAGFEQVL LLRNPLVQDY SQILDTYVLD YGITMYRYSY ATAAGMFKSV VSILLVLFAN
KVAAKLNAST VV