Gene Cthe_3148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3148 
Symbol 
ID4809711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3719026 
End bp3720924 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content44% 
IMG OID640108581 
ProductABC transporter related protein 
Protein accessionYP_001039536 
Protein GI125975626 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAAATA ATGTCAAGAA AAACAAAAAA TCGATTGTGC TTCGATTGTT CAAAGATATA 
TTGGAGTTTT ACCCGGTAAT GTTACCGGTT GCTATTGTGT GTATATTATT CAATGCGACT
ATCAGCTCTA TTCCGGCTGT ATTTATGCAA AACGTGATTG CCATTGTGGA GAATAACTGG
CGGACCGGAG ACTGGAATGC GGTAGGGGGA AGGATATTAT CCCTTGTTGC CGTATTGGTG
GCATTTTACA CGCTGTCGAT CCTTAGCGGT ATTGCATACA ACCAGATGAT GGCGATTATT
ACCCAGGGGA CTTTGAAGAA ATTCCGCTGC AAAATGTTTG CCAGGATGCA GACTCTTCCA
ATCAAGTATG TGGATACTCA CAATCATGGT GACATTATGA GTGTTTACAC CAATGATATT
GATACGCTGC GCCAAATGAT CTCCCAAAGC TTTCCGCAAC TTTTATTGTC CGGTATTACG
GTTCTTACCG TATTTTCCAT CATGGTTTAT TTTTGTTTGT GGCTGACGAT TATTGTATTA
ATAGGTGTAA TCGCAATGTT TTTTGTTACA AGAAGGGTGG GCGGTGGTTC CGCCAAGTAT
TTCATCAGAC AGCAGAAAGC CCTTGGACGC GTGGAAGGCT TTATAGAAGA AATGATGAAC
GGACAAAAAG TAATAAAAGT ATTTTGTCGC GAAGAGGAAG TCAAGAAAGA TTTTGACAAA
CTCAATGAGG CTTTATTTGA TGATGCAAGG AAAGCAAACC GCTATGCCAA TATTTTAGCT
CCGATTTTAA ATAACATTGG TAATGTATTG TATGTTTTTG TTGCCATTAC CGGCGGTGTT
TTATTGGTTA CAAACGCGCC GAATGTAAGT CTTTCCGGAC TTTCCATGGG AATCAGCATT
GTCGTTCCTT TCCTTAACAT GACAAAACAG TTTGTTGGCA ATATAAACCA GGTGTCCCAG
CAGATAAATG CGGTTATTAT GGGGCTTGCC GGTGCGCAGC GGATTTATGA GCTGATTGAC
GAGGAACCGG AGCAGGATGA CGGATATGTT ACCTTGGTAA ATGTCCGCGA AGAAAACGGT
CAGTTGATTG AATGTGAAGA GAGAACGGGA ATCTGGGCCT GGAAACATCC TCACAGCAGT
GACGGCTCGG TAACGTATAC GAAGCTCATG GGGGATGTGA GATTGTTTAA CGTCGATTTC
GAATATGAAC CGGGAAAGTC TGTTTTGCAT GATATCAGCC TTTATGCGAA GCCGGGTCAA
AAAGTGGCGT TTGTCGGTGC CACCGGTGCC GGCAAGACCA CAATTACGAA TTTACTTAAT
CGCTTTTATG ATATTGCCGA CGGTAAAATA CGTTATGACG GTATTAATAT CAACAAAATC
AAAAAATCGG ATCTTCGCCG CGCCGTTGGT ATAGTACTTC AGGATACCAA CCTTTTTACC
GGTACCGTAA TGGACAATAT CCGCTACGGC AAACTGGATG CCACTGATGA AGAGTGTATT
GCGGCCGCAA AACTTGCAGG TGCCGATGAC TTTATACGCC GTTTGCCCGA CGGATATTAT
ACAATGCTTA CTGAAAACGG GGCAAATCTG TCCCAGGGAC AGAGACAGTT GATTTCCATT
GCGAGAGTGG CGGTTGCGGA TCCTCCGGTT ATGATTTTGG ATGAAGCGAC ATCTTCCATT
GATACAAGAA CCGAGGCAAT TGTTCAGCGG GGTATGGATG CTTTGATGGA GGGAAGGACT
GTGTTTGTAA TTGCCCATCG TCTGTCCACG GTTAAAAACG CCAATGTTAT CATTGTGTTG
GATCACGGAC GTATCATTGA ACGGGGCACT CACGAGGAAC TGATAGCCCA AAAGGGTCAG
TATTATCAAC TCTACACAGG TGCTTTTGAG CTGGAATAA
 
Protein sequence
MENNVKKNKK SIVLRLFKDI LEFYPVMLPV AIVCILFNAT ISSIPAVFMQ NVIAIVENNW 
RTGDWNAVGG RILSLVAVLV AFYTLSILSG IAYNQMMAII TQGTLKKFRC KMFARMQTLP
IKYVDTHNHG DIMSVYTNDI DTLRQMISQS FPQLLLSGIT VLTVFSIMVY FCLWLTIIVL
IGVIAMFFVT RRVGGGSAKY FIRQQKALGR VEGFIEEMMN GQKVIKVFCR EEEVKKDFDK
LNEALFDDAR KANRYANILA PILNNIGNVL YVFVAITGGV LLVTNAPNVS LSGLSMGISI
VVPFLNMTKQ FVGNINQVSQ QINAVIMGLA GAQRIYELID EEPEQDDGYV TLVNVREENG
QLIECEERTG IWAWKHPHSS DGSVTYTKLM GDVRLFNVDF EYEPGKSVLH DISLYAKPGQ
KVAFVGATGA GKTTITNLLN RFYDIADGKI RYDGININKI KKSDLRRAVG IVLQDTNLFT
GTVMDNIRYG KLDATDEECI AAAKLAGADD FIRRLPDGYY TMLTENGANL SQGQRQLISI
ARVAVADPPV MILDEATSSI DTRTEAIVQR GMDALMEGRT VFVIAHRLST VKNANVIIVL
DHGRIIERGT HEELIAQKGQ YYQLYTGAFE LE