Gene Cthe_2267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2267 
Symbol 
ID4810005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2695033 
End bp2696802 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content48% 
IMG OID640107673 
ProductV-type ATP synthase subunit A 
Protein accessionYP_001038662 
Protein GI125974752 
COG category[C] Energy production and conversion 
COG ID[COG1155] Archaeal/vacuolar-type H+-ATPase subunit A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCCAGG GAACAATAGT TAAAGTCTCC GGGCCTTTGG TAATTGCCGA AGGCATGAGA 
GATGCAAACA TGTTTGACGT TGTACGTGTA AGTGAACATC GTTTAATTGG CGAAATAATC
GAAATGCATG GAGATCGAGC CTCCATCCAG GTATACGAAG AAACCGCAGG CTTGGGCCCC
GGCGAACCGG TGGTTTCCAC CGGAGCGCCT CTAAGTGTTG AGCTGGGACC TGGGCTTATT
GAAAATATTT TTGACGGTAT TCAAAGACCT CTTGTAAAAA TGAGAGAAAT GGTTGGCAGC
AACATAACAA GAGGTATTGA CGTTACTGCC CTTGACAGAA GCAAAAAGTG GGATTTTCAA
CCTACCGTAA AAAAAGGTGA CAAAGTAACC GCCGGCGATG TAATAGGAAA AGTCCAGGAA
ACTTCCATTG TGGAGCACAG AATAATGGTG CCCTATGGAG TACAGGGAAC AATTGAGGAG
ATAAAGAGCG GAAGCTTTAC TGTGGAGGAA ACCGTCGCAA AGGTTCGGAC AGAAAACAAC
GAACTGGTTG ATATCTGCAT GATGCAGAAA TGGCCGGTAC GTATCGGCCG TCCATATAGA
GAAAAGCTCC CCCCCAACGC TCCACTTGTT ACAGGTCAAA GGGTTATAGA CACTCTATTC
CCTTTGGCCA AAGGTGGAGT TGCGGCCGTA CCCGGACCTT TCGGAAGCGG TAAAACCGTG
GTTCAGCACC AGCTTGCAAA ATGGGCCGAC GCTGATATAG TTGTCTATAT AGGCTGCGGA
GAGCGCGGCA ACGAAATGAC CGACGTTTTA AAAGAATTCC CGGAGCTTAA AGACCCAAAA
ACCGGCGAAT CTCTTATGAA GAGAACCGTT CTTATAGCAA ATACGTCAGA CATGCCTGTT
GCGGCCAGAG AGGCATCCAT TTATACAGGC ATGACTATTG CGGAATATTT CAGGGATATG
GGCTATAGTG TGGCGTTAAT GGCAGACTCC ACTTCCCGCT GGGCGGAAGC ATTAAGAGAA
ATGTCCGGAC GTCTCGAAGA AATGCCCGGT GAAGAAGGTT ATCCGGCATA TCTTGGCTCA
AGGCTTGCCC AGTTCTATGA AAGAGCGGGA AGAGTTGTAT GCCTTGGTTC CGACGGAAGA
GAAGGTGCCC TTACCGCCAT CGGTGCCGTG TCACCTCCGG GCGGTGACCT TTCCGAACCT
GTTACACAGG CAACACTGAG AATTATCAAA GTGTTCTGGG GGCTTGACTC AAGTCTTGCC
TACAGACGAC ATTTCCCTGC AATCAACTGG CTGCAGAGCT ATTCGCTGTA CCTTGACATA
ATAGGAAAAT GGATTAGTGA AAACATTTCA AGGGATTGGG AGACATTAAG ATCCGACACT
ATGCGCATTC TGCAGGAGGA AGCGGAACTT GAGGAAATTG TGCGCTTAGT CGGTGTTGAC
GCCCTGTCGC CGTCCGACAG GCTTACTCTG GAAGCCGCCA AGTCGATACG CGAAGACTAT
CTCCACCAGA ATGCCTTCCA TGAAGTGGAT ACCTATACTT CATTAAACAA GCAGTACAGA
ATGTTAAAAC TCATACTGGG ATTCTATTAC AGCGGCAAAA AAGCCCTGGA AGCAGGAGTA
AGCATCAAAG AGCTGTTTGA ACTTCCTGTC AGGGAAAAAA TCGGAAGAGC GAAATATACG
CCCGAGGATC AGGTAAACAG CCACTTCAAC GAAATTGAAA AAGAACTTAA TGAGCAAATA
GAAGCCCTCA TCGCAAAGGA GGTGCAATAA
 
Protein sequence
MSQGTIVKVS GPLVIAEGMR DANMFDVVRV SEHRLIGEII EMHGDRASIQ VYEETAGLGP 
GEPVVSTGAP LSVELGPGLI ENIFDGIQRP LVKMREMVGS NITRGIDVTA LDRSKKWDFQ
PTVKKGDKVT AGDVIGKVQE TSIVEHRIMV PYGVQGTIEE IKSGSFTVEE TVAKVRTENN
ELVDICMMQK WPVRIGRPYR EKLPPNAPLV TGQRVIDTLF PLAKGGVAAV PGPFGSGKTV
VQHQLAKWAD ADIVVYIGCG ERGNEMTDVL KEFPELKDPK TGESLMKRTV LIANTSDMPV
AAREASIYTG MTIAEYFRDM GYSVALMADS TSRWAEALRE MSGRLEEMPG EEGYPAYLGS
RLAQFYERAG RVVCLGSDGR EGALTAIGAV SPPGGDLSEP VTQATLRIIK VFWGLDSSLA
YRRHFPAINW LQSYSLYLDI IGKWISENIS RDWETLRSDT MRILQEEAEL EEIVRLVGVD
ALSPSDRLTL EAAKSIREDY LHQNAFHEVD TYTSLNKQYR MLKLILGFYY SGKKALEAGV
SIKELFELPV REKIGRAKYT PEDQVNSHFN EIEKELNEQI EALIAKEVQ