Gene Cthe_2522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2522 
Symbol 
ID4809278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2990668 
End bp2992290 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content38% 
IMG OID640107938 
Productmembrane associated protein 
Protein accessionYP_001038917 
Protein GI125975007 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0879988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGCAA TGGTAGTTGA TATGAATGAC AAATACGCTG TTGTTGTTAA TAAAGAGGGT 
CAATACATTA AGATTAAGAG GAAAGCAGAG CATAGATTGG GCTATCAAGT TGAATTGCCG
GACAGAGTGA TTGGATTTGA AAGAAGAACG TTATTGAAAG TAGTATCTGT GGCAGCAGCT
CTGTTGATTG TTTCAAGTAT CTCCTTTGCC GTATACAGCT ATAATTTGCC TTACAGCTAC
GTGAATGTTG ACATAAATCC CAGTTTGGAG ATAATCCTTA ATATGTACAA CCGGATTATT
GATGTAAAAG CGTTAAATTC TGAAGGCGAG ATGCTGATTG AGGATTCTTA TAAGAATTCC
CGGTTGGATG AAGGTGTGGA AAAAATTATT GACAGTGCCG TGGCACAAGG TTTCCTTAAA
AATGATGAAG AAAATACCAT CATGCTTACC GTTGCAGGCA AGAATTCCAG AAAAGTTCTT
GAAATAAAGG AAGAAGTGGA GAGTACAGCA AACAAGGTTT TAAATGATGA TAACGTGGTT
TCCGAGGTGA TTGTTGAGAA CATAGTGCTG GAAAGACGCG AAGAAGCCAG AGAACTTGGT
ATAGCTCCGG GCAAATTGCT TTTGATTGAA AAGCTTAAAG AAGTCGATCC CAAGGCAACT
ACCGAAGAGT ACAAGGACAA ACCTGTGAAT GAGATTGTAA AAACCATCAG GGACATAAAG
AAAGTTCCAA ATGAGAACAA CCGAAAGGAT GACGATAAAA AGGTAAACAA TGAGCCGAAT
AAGCCATTAC CTGACAGAAA AGCCGATGTG GAAACAAGTG CCGGGGTAAA AGAAAATACC
GCCGGTCCGG ATGCAGGCAT CAAACCGGTG AATAAAACCG ATAATGCTAA ACCCAATGTT
GGTACCGACA TAAATAACAA AGAGAATAAA ACAGTCAGCA ATGCGAAGAT TGACAGCGGC
ATTGACAAGG GCAACAAAGA CAGTAAACCC AACAGTAATA CTAAAATTAA TAACGACGTC
AAAAAGGACA ACAAAGATAA TAAAACCAAC AGTGATGCCA AAACCTTCAA CGATGTCAGC
AAAGACAACA AAAATGATAA AGCTGACGGC AATGCTAAAA TCAACAATAA CATCAACAGA
GACAATAAAA TTACTCCGAT TAATCCGGAT AATAAATTTA GCAGCGGCGG CAGCAAAGAC
GACAAAGATA ACAAGCATGT TGATAGCAAA GATAAAATGA ATAATGAAGA CAACAAAAAC
ATTAACAATG GCAGCTGCCC CCAATACAAT CCATATTGGA ACCCTTACTG GAATCCCTAT
TGGAATCCAT ATTGGGGAAA TCCGAAAGAA AAAGAGGATA TGACAAAGCA AAATGATGAA
TGGTTTAAAA AGATGCAGGA AGAACAAAAG AAACAGTACG ATGAATGGCT GAAAAAGATG
CAGGAGGAGC AAAAAAAGCA GCATGATGAG TGGGTTAAAA AGATGGAAGA AATGAAAAAT
ACGGAAAAGA TGAAAAATCC ATACCAGGAA AATAAAATTG AAAAACCCAA AGAGGCAGAA
AAGGAGAATA AACCGGACAG ACCTCCGGAG CCGGGAAAAG AAATTTTGAA GAAAAGATGC
TAA
 
Protein sequence
MRAMVVDMND KYAVVVNKEG QYIKIKRKAE HRLGYQVELP DRVIGFERRT LLKVVSVAAA 
LLIVSSISFA VYSYNLPYSY VNVDINPSLE IILNMYNRII DVKALNSEGE MLIEDSYKNS
RLDEGVEKII DSAVAQGFLK NDEENTIMLT VAGKNSRKVL EIKEEVESTA NKVLNDDNVV
SEVIVENIVL ERREEARELG IAPGKLLLIE KLKEVDPKAT TEEYKDKPVN EIVKTIRDIK
KVPNENNRKD DDKKVNNEPN KPLPDRKADV ETSAGVKENT AGPDAGIKPV NKTDNAKPNV
GTDINNKENK TVSNAKIDSG IDKGNKDSKP NSNTKINNDV KKDNKDNKTN SDAKTFNDVS
KDNKNDKADG NAKINNNINR DNKITPINPD NKFSSGGSKD DKDNKHVDSK DKMNNEDNKN
INNGSCPQYN PYWNPYWNPY WNPYWGNPKE KEDMTKQNDE WFKKMQEEQK KQYDEWLKKM
QEEQKKQHDE WVKKMEEMKN TEKMKNPYQE NKIEKPKEAE KENKPDRPPE PGKEILKKRC