Gene Cthe_2706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2706 
Symbol 
ID4810700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3192796 
End bp3193905 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content43% 
IMG OID640108125 
ProductABC transporter related protein 
Protein accessionYP_001039098 
Protein GI125975188 
COG category[V] Defense mechanisms 
COG ID[COG1131] ABC-type multidrug transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000123322 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGAAA TACAAAATTT GACCAAAAGA TATGGTCAAA TAGTAGCCGT TGACAACCTG 
AATTTCACGG TCGAAAAGGG AGAAATTGTA GGCTTCCTTG GACCGAACGG AGCGGGCAAG
TCAACCACCA TGAACATAAT AACCGGCTAT CTGCCTTCGA CCGAGGGGAC TGTAAAAGTA
GCCGGGTACG ACATAGCCGA GCAGCCGAAT GAAGTAAAAA GACGTATCGG TTATTTGCCG
GAAAATCCGC CGCTTTATAC AGATATGACC GTCGATGAGT ACTTAAACTT TGTCAGTGAA
CTTAAAAAAG TGGAAAAAAG CAAGAGAAAG CAGCAAATAT CTGACATTAT GGAGATGTTG
AAAATTACCG ATGTCAGAAA AAGGCTTATA AGAAACCTTT CAAAGGGTTA CAAGCAGAGG
GTGGGATTTG CCCAGGCTTT GATAGGAAAT CCCGAAGTTC TTATTTTGGA CGAGCCTACG
GTCGGTCTTG ATCCCAACCA GATACTTGAG GTAAGGAACG TTATAAAAGA ATTGAGAAAA
GATCACACAA TTATTTTCAG TACTCATATA ATGCAGGAAG TAAGCGCAGT ATGTGAACGC
ATTGTCATAA TAAACAAGGG TAGAATTGTG GCGGTGGACA CACCGGAAAA TCTGGCAAGA
GCTATTTCCA AGAGTTTGAG GTTAACGTTT AAGATTGCAG GAGAAAAGTC TTCTGTCATA
GCAGCCCTTC AGGCTGTTGA CGGTGTCAGA AATGTTGAAG TGCAGGATAA AGTTGAAGAT
GATGTATATG TATATATAGT TGATGCGGAC AAGGGAGTGG ATGTAAGAAA GCCTATATTC
TTTGCGATGG CAGATCTGGG CTATCCGATT CTTGAGACCA AGGAAGATGA AATGGGTCTT
GAGGAAATAT TCCGCGAGCT TACAACAAAG GATGTTGCTG TGAACGATGA GTCTGAAGGT
TCCGGGGAGG AAGTTCCCGA AAGTTCACAG GAAGGAGCCT CGGAAGAAGT ATCCACAGAG
AATGCTGAGG CAGAAGAGAA TGAACAGGAG CAAGAAAACG GGCAGGAACA GGACAGTGCT
CAGACCGAGG AAAAGGAGGT TGAGGAATAA
 
Protein sequence
MIEIQNLTKR YGQIVAVDNL NFTVEKGEIV GFLGPNGAGK STTMNIITGY LPSTEGTVKV 
AGYDIAEQPN EVKRRIGYLP ENPPLYTDMT VDEYLNFVSE LKKVEKSKRK QQISDIMEML
KITDVRKRLI RNLSKGYKQR VGFAQALIGN PEVLILDEPT VGLDPNQILE VRNVIKELRK
DHTIIFSTHI MQEVSAVCER IVIINKGRIV AVDTPENLAR AISKSLRLTF KIAGEKSSVI
AALQAVDGVR NVEVQDKVED DVYVYIVDAD KGVDVRKPIF FAMADLGYPI LETKEDEMGL
EEIFRELTTK DVAVNDESEG SGEEVPESSQ EGASEEVSTE NAEAEENEQE QENGQEQDSA
QTEEKEVEE