Gene Aave_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAave_1004 
Symbol 
ID4667973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax citrulli AAC00-1 
KingdomBacteria 
Replicon accessionNC_008752 
Strand
Start bp1101848 
End bp1103107 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content71% 
IMG OID639822204 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_969376 
Protein GI120609698 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGACA TGCATTCCCC CGGCGCAGTC GGGCAGTCCG GTGCCGGCGC GGTGGAAGGC 
GCCGCGGGCC AGACTGCCGG CCCGCTCGCG CCCTATGCCA GCGACCCCGC CCGGACGCGC
GGCCGCCGCC ATCCGCAGGC ACCCGCGCCC ACGCGCACCG AGTACCAGCG CGACCGCGAT
CGCATCGTGC ATTCGACCGG TTTCCGCCGG CTGGTGTACA AGACCCAGGT GTTCCTCAAC
CACGAGGGCG ACCTGTTCCG CACCCGGCTC ACGCATTCGC TCGAAGTCGC GCAACTGGGG
CGTTCCATCG CGCGCTCGCT GCGCATCAAC GAGGATCTGG TCGAGGCGAT TTCGCTGGCG
CACGACCTGG GGCATACGCC TTTCGGGCAT GCGGGCCAGG ATGCGCTGAA TGCCTGCATG
GCCCCGTACG GAGGCTTCGA GCACAACCTG CAGAGCCTGC GCGTGGTGGA TGCGCTGGAA
GAGCGCTACC CGGACTACGA CGGCCTGAAC CTGACCTTCG AGACGCGCGA GGGCATCCTC
AAGCACTGCT CGCGCGCCAA TGCCGAGCGC CTGGAGGCCT CGGAGCCCGG CGGCGTGGGC
GCGCGGTTCC TGCGCCGGGA GCAGCCCGGC CTGGAGGCGC AGCTCTGCAA CCTGGCCGAC
GAGATCGCCT ACAACGCCCA CGACATCGAC GATGGCGTGC GTTCCGGCCT GATCACCCTG
GCCCAGCTGC GCGACGTGCC GCTGTTCGAC CGTTTCCGGG CCGATGCCGA GCGGGAACAT
CCGCACCTGG CCGCGCCGCA GGCACGGCGC CGGCTGCTGC ACGAGGCCAT CCGCCGCATG
CTGAGCGCCC AGGTGTACGA CGTGATCGGC GCCACCGAAG CCGCGCTGGC GCAGGCTGCA
CCGCGCACGG CGGACGATGT GCGCCGCATG CCGCCGCTGG TGGCTTTCAG CGCGGCCATG
CGTGCGCAGT CCATCGAACT CAAGCGCTTC CTGTTCCAGA ACCTCTACCG CCATCCGCAG
GTGATGGAAA CCACCGGCCA CGCCCAGCAG GTGGTGCGCG ACCTGTTCGG CATCTACGCG
GAGCGGCCTT CGGAAATGAA GCACCGGTTC GCCGACCGCG CCGTGGCGGC GCGGGAGGCA
GCTGCCGGTG CCGGCGGCCT GCCCCATGCC CGCATCGTCG CCGACTTCAT CGCCGGCATG
ACGGACCGTT TCGCGGTGCG CGAGCACGAA CGGCTGACGG GCCGCCGGCT GCTCTCCTGA
 
Protein sequence
MADMHSPGAV GQSGAGAVEG AAGQTAGPLA PYASDPARTR GRRHPQAPAP TRTEYQRDRD 
RIVHSTGFRR LVYKTQVFLN HEGDLFRTRL THSLEVAQLG RSIARSLRIN EDLVEAISLA
HDLGHTPFGH AGQDALNACM APYGGFEHNL QSLRVVDALE ERYPDYDGLN LTFETREGIL
KHCSRANAER LEASEPGGVG ARFLRREQPG LEAQLCNLAD EIAYNAHDID DGVRSGLITL
AQLRDVPLFD RFRADAEREH PHLAAPQARR RLLHEAIRRM LSAQVYDVIG ATEAALAQAA
PRTADDVRRM PPLVAFSAAM RAQSIELKRF LFQNLYRHPQ VMETTGHAQQ VVRDLFGIYA
ERPSEMKHRF ADRAVAAREA AAGAGGLPHA RIVADFIAGM TDRFAVREHE RLTGRRLLS