Gene Cthe_2934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2934 
Symbol 
ID4810217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3449604 
End bp3450503 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content43% 
IMG OID640108357 
ProductABC transporter related protein 
Protein accessionYP_001039325 
Protein GI125975415 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1122] ABC-type cobalt transport system, ATPase component 
TIGRFAM ID[TIGR01166] cobalt transport protein ATP-binding subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.786781 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAATTC ACCTTAATCC CTGTTTTGTT AGGAGAGTAA AAATGAAGGA TATAATAGTA 
AAAGCCAATG ATGTGGAATA CGCGTATAAA AAAAACAGCG ACGAAACGCC CAAAGTTTTG
GTGCTTAAGG AACTCAACAC TGAAATATGC GAAGGGGAAT TTGTGGCTGT TATCGGGCGC
AACGGGTCCG GAAAATCGAC TTTTGCAAGA CTTCTCAATG CTATTTTGAT TCCCACCCGG
GGCGTACTCT ACATAGGTGG AAAAGAAACC TACACTGAAG CGAATTTGTG GGAAATAAGG
CGAACGGTGG GCATGGTGTT TCAAAATCCG GACAATCAAA TAATTGCAAC CTCGGTTGAA
GAGGATGTGG CTTTTGGCCC GGAGAACATT GGCATACCTT CGGACGAGAT TGTAAAAAGG
GTGGAAGAGG CCCTAAGAAG TGTGGGGCTT GAGGAGTATA AAAAAGCATT GCCCCACCAT
CTGTCAGGCG GTCAAAAACA AAGGGTGGCC ATAGCAGGCA TTTTAGCCAT GAAGCCAAAA
TGCATTGTTC TTGATGAGGC AACTTCAATG CTTGACCCGT CAGGAAGAAA AGAAGTGCTG
AAAGTCCTTA GAGATTTGAA TGAAAAGGAA AACATAACAA TCATCCATAT TACCCATTAT
ATGGAAGAGG CAATACTTGC AAAAAGGATT TTGGTGATGG ATGAAGGTAA GATCGTTATG
GACGGAAACC CCCGCCAAAT TTTTTCAAAA GTGGAAGAAA TTAAGGCTTT AGGGCTTGAT
GTGCCGCAGG TGGCAGAACT ATTCCATGAA CTGAAAAAAG ACGGCTATAA TGTGCCGGAT
AATATACTTA CCGTGGAAGA AGCGGTCCAA TGTCTAGCAG AGATGATTGC AAAAGCATGA
 
Protein sequence
MGIHLNPCFV RRVKMKDIIV KANDVEYAYK KNSDETPKVL VLKELNTEIC EGEFVAVIGR 
NGSGKSTFAR LLNAILIPTR GVLYIGGKET YTEANLWEIR RTVGMVFQNP DNQIIATSVE
EDVAFGPENI GIPSDEIVKR VEEALRSVGL EEYKKALPHH LSGGQKQRVA IAGILAMKPK
CIVLDEATSM LDPSGRKEVL KVLRDLNEKE NITIIHITHY MEEAILAKRI LVMDEGKIVM
DGNPRQIFSK VEEIKALGLD VPQVAELFHE LKKDGYNVPD NILTVEEAVQ CLAEMIAKA