Gene Dgeo_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2047 
Symbol 
ID4058393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp2154183 
End bp2155931 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content64% 
IMG OID641231086 
ProductV-type ATP synthase subunit A 
Protein accessionYP_605510 
Protein GI94986146 
COG category[C] Energy production and conversion 
COG ID[COG1155] Archaeal/vacuolar-type H+-ATPase subunit A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAC AGCAAAGAGG CGTCGTGCAA AGCATCGCTG GTCCCGCCGT GATTGCAAGC 
GGGATGTATG GTGCGAAGAT GTACGACATC GTGCGCGTGG GCAAGGAGCG CCTGGTCGGT
GAGATCATCC GGCTGGAGGG CAACACCGCC TTCGTGCAGG TGTACGAAGA TACCAGCGGC
CTGACGGTGG GTGAACCCGT TGAGACGACC GGTTTGCCGC TCAGCGTGGA GCTGGGGCCG
GGGATGCTCA ACGGCATCTA CGACGGCATC CAGCGTCCGC TCGACAAGAT CCGCGAGGCT
TCCGGCGACT TCATCGCGCG CGGCATCGAG GTGTCGAGCC TCGACCGCAC CAAGAAGTGG
GCCTTTACGC CGACCGTGCA GGCGGGTGAC ACCGTAGGCG GCAGCTCGAT TCTGGGGACC
GTGCCGGAAT TCAGCTTCAC CCACAAGATT CTGACACCGC CGGACAAGGG GGGGCGGCTG
ACCTGGGTGG CCCCCGCAGG CGAATACACC ATCGATGACA CCATCGCCAC CTTGGAAGAC
GGCACGAACC TGCGCCTGGC CCACTACTGG CCGGTGCGGG CGCCGCGTCC GGTCGCGCAG
AAGCTGGATC CCAGCCAGCC CTTCCTGACG GGCATGCGCA TCCTCGACGT GCTGTTCCCG
CTGGTGATGG GCGGTACAGC GGCGATTCCC GGTCCCTTCG GTTCGGGCAA AACGGTGACG
CAGCAGTCGG TTGCGAAGTA CGGCAACGCC GATATCGTGG TGTACGTGGG CTGTGGCGAG
CGCGGCAACG AGATGACCGA TGTGCTCGTG GAGTTCCCGG AACTGGAAGA CCCCAAGACC
GGCGGGCCCC TGATGCACCG CACCATCCTG ATCGCCAACA CCTCCAACAT GCCGGTGGCA
GCGCGTGAAG CCTCGGTCTA TACCGGCATC ACATTGGCCG AGTACTTCCG CGACCAGGGC
TACAGCGTTT CGCTGATGGC CGACTCCACC AGCCGCTGGG CCGAGGCGCT GCGCGAGATC
TCCTCCCGTC TGGAAGAGAT GCCCGCCGAA GAGGGCTATC CGCCCTACCT GGGCGCCAAG
CTGGCGGCCT TCTACGAGCG CGCGGGGGCC GTGAAGACCC TGGCCGGGGA AGACGGCGCG
GTTTCCGTGA TCGGGGCGGT GTCCCCGGCG GGCGGCGACA TGTCTGAACC CGTCACCCAG
GCGACGCTGC GCATCACCGG TGCCTTCTGG CGTCTGGATG CGGGTCTGGC CCGGCGCCGT
CACTTCCCGG CGATCAACTG GAACGGTTCC TACAGCCTGT TTACGCCGAT TCTCGATTCC
TGGTACCGCG AGAACGTGGG CCGGGACTTC CCCGAACTGC GCCAGCGCAT CAGCAACCTG
CTTCAGCAGG AAGCGTCCCT CCAGGAAGTT GTGCAGCTCG TCGGCCCCGA TGCGCTGCAG
GATCAGGAAC GCCTGGTGAT CGAGACGGGC CGTATGCTGC GGCAGGACTT CCTCCAGCAG
AACGGCTTTG ACCCGGTGGA TGCCTCGGCG TCTATGCCCA AGAACTACGG CCTGATGAAG
ATGATGCTGA AGTTCTACGA CGAGGCGGAG GCCGCGCTGC GAAATGGCGT TGGCATCGAT
GAAATCATTC AGAACCCGGT GATCGAGAAA CTCTCGCGCG CTCGCTACGT GCCTGAGGCC
GACTTCATGG CCTACGCCGA GAGCGTGATG GACGAACTCG ACACCACCTT CAAAGGAGTG
AAGGCGTGA
 
Protein sequence
MTQQQRGVVQ SIAGPAVIAS GMYGAKMYDI VRVGKERLVG EIIRLEGNTA FVQVYEDTSG 
LTVGEPVETT GLPLSVELGP GMLNGIYDGI QRPLDKIREA SGDFIARGIE VSSLDRTKKW
AFTPTVQAGD TVGGSSILGT VPEFSFTHKI LTPPDKGGRL TWVAPAGEYT IDDTIATLED
GTNLRLAHYW PVRAPRPVAQ KLDPSQPFLT GMRILDVLFP LVMGGTAAIP GPFGSGKTVT
QQSVAKYGNA DIVVYVGCGE RGNEMTDVLV EFPELEDPKT GGPLMHRTIL IANTSNMPVA
AREASVYTGI TLAEYFRDQG YSVSLMADST SRWAEALREI SSRLEEMPAE EGYPPYLGAK
LAAFYERAGA VKTLAGEDGA VSVIGAVSPA GGDMSEPVTQ ATLRITGAFW RLDAGLARRR
HFPAINWNGS YSLFTPILDS WYRENVGRDF PELRQRISNL LQQEASLQEV VQLVGPDALQ
DQERLVIETG RMLRQDFLQQ NGFDPVDASA SMPKNYGLMK MMLKFYDEAE AALRNGVGID
EIIQNPVIEK LSRARYVPEA DFMAYAESVM DELDTTFKGV KA