Gene Gdia_0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0444 
Symbol 
ID6973838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp488616 
End bp489818 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content70% 
IMG OID643389976 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_002274855 
Protein GI209542626 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.662695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCG CACCCTACGC TGTCCAGCCT GCCACCGCCC GGGGCCGGCT GCATGCCGAA 
CCCGAATCGC CCAGCCGCAC GCCGTGGCAG CGTGACCGCG ACCGTGTCCT GCATTCGGCG
GGCTTCCGCA CCCTGCAATA CAAGACCCAG GTCTTCGTCA ATCATCAGGG CGACTTCTTC
CGTACCCGCC TGACCCATTC GCTGGAGGTC GCGCAGATCG CGCGGTCCAT CGCCCGGAAC
CTGGGGGTGG ACGAAGACCT GACGGAAACG CTGGCCCTGG CCCACGATCT GGGCCACACG
CCCTTCGGCC ATGCGGGCGA GGACGCGCTG GCGGCGGCGA TGCGCGCCTG GGGCGGGTTC
GACCACAACA CCCAGACCCT GCGCCAGGTG ACGCAACTGG AGCGCCGCTA TTTCGGCTTC
GACGGGCTGA ACCTGACATG GGAAACGCTG GAAGGCCTGA TCAAGCATAA CGGGCCGGTG
GACCACCCGA CGGGCTATGT CGCGCGCTAT GCCGAACGGC TGGGTCTGGA CCTGACCACC
TTCGCGCCGG TCGAGGCCCA GGTCGCGGCG ATGGCCGACG ACATCGCCTA TCACGCCCAC
GACCTGGATG ACGGGCTGCG CGCGGGGCTG CTGTGCCTGT CGGACCTGGC GGGCCTGCCG
GTGGTGGGGG CGGCGCTGGC GCAGGTCCGG CAACTGGCCG GGGGGGCGGA CCTGCCGGCC
TCGGCGCCGC AGGCACCCGG CCTGCATGCC GCCGACCTGC ATGTGGACGA CCGGATGCGC
CACGAAACCA TCCGCCGGGT CATCAATGCG CTGGCGGTGG ACCTGACGGA GCAGACGCGG
CGCAACCTGG AACGGCTGGC CCCCCGTTCG GCCGACGACG TGCGCCGGGC GGAGGCACCG
GTCGTGGCCT ACAGCCCCGC CATGGCGCGC GATAACGGGG CCATCCGCAC CTTCCTCTAC
GCACGGCTGT ACCGGCACTG GCGGGTCAAC CGCATGACGC GCAAGGCGCG CATGGCGGTC
GAATCCATCT TCTCGATCCT GGCCGATGAC CTGTCGCTGC TGCCGGACGG CTGGCGGCAG
CAGGCGCGCG GGGCGGACCA GACCGGTGCG CGCCGGGTCG TGGCCGATTA TATAGCCGGA
ATGACGGACC GATTCGCGAT GGAAGAACAT CGACGGTTGA CGGATCTGTC CGTGCCGGGC
TGA
 
Protein sequence
MSIAPYAVQP ATARGRLHAE PESPSRTPWQ RDRDRVLHSA GFRTLQYKTQ VFVNHQGDFF 
RTRLTHSLEV AQIARSIARN LGVDEDLTET LALAHDLGHT PFGHAGEDAL AAAMRAWGGF
DHNTQTLRQV TQLERRYFGF DGLNLTWETL EGLIKHNGPV DHPTGYVARY AERLGLDLTT
FAPVEAQVAA MADDIAYHAH DLDDGLRAGL LCLSDLAGLP VVGAALAQVR QLAGGADLPA
SAPQAPGLHA ADLHVDDRMR HETIRRVINA LAVDLTEQTR RNLERLAPRS ADDVRRAEAP
VVAYSPAMAR DNGAIRTFLY ARLYRHWRVN RMTRKARMAV ESIFSILADD LSLLPDGWRQ
QARGADQTGA RRVVADYIAG MTDRFAMEEH RRLTDLSVPG