Gene Apar_0697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0697 
Symbol 
ID8413558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp776129 
End bp777187 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content43% 
IMG OID645022275 
Productdeoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_003179717 
Protein GI257784500 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000013673 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000510491 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTATCCA TTCAACGAGA AGATCTGGAA GCTCGTGAGC ATCAAATCTT ATCTCCTGAA 
GCGGCTTTTT CTGATCAAAG TAAAGGCCGT GCGGTGGCAG AGGAGCCTGA CCAGTATCGT
ACGTGTTATC AGTGCGATAG AGATCGTATC CTTCATAGTA AGAGCTTCAG AAGACTTGCT
CACAAGACAC AGGTTTTTCT CGCCCCAGAG GGGGATCACT ATAGAACTCG TCTTATCCAT
ACGCTTGAGG TTTCACAGAT TGCACGCTCA ATTGCACGAC CTCTTGGCTT AAACGAAGAT
CTAACTGAGG CAATTGCGCT TGGACATGAT TTGGGCCATA CGCCGTTTGG ACATATAGGT
GAAAAGGCGC TTTCGTTTGC TATTAGTCTG TACAGAGGAA TGGATCCTGA TGCTCCAGAA
AATGAGTATA TTTTTGCTCA TAACCAGCAA AGTGCTCGCA TTGTTGAGTA TTTAGAGAAA
GACGGACAGG GTCTTAATCT TTCGTATGAG GTTGTTGATG GTATTAGATG TCACTCAGGT
AACCTACGTG CAGAAACGGC AGAAGGAAGA ATTGTTGCTA TCTCAGATCG TATTGCCTAT
GTAACACACG ATATTGATGA CGCAAAACGC GCCGGCCTGC TTTCGGAGGA GTATCTTCCA
ACTGAGGCTC GCGAGGTGTT GGGCAATAGT TCGCCTGAGC GTATTGAACA TATGGTTCAT
GATATTGTCT CTGAGAGTTC TCGAGTAGGG GACATTAAGA TGACTGACTC TATGTGGAGT
GCCATGATGA CCATGAGAGC TTTTCTGTTT GCTAATCTTT ACGCATCAGG TGACGCAAAA
TATGAAGAAC CTAAAGCGTA TGATCTCATC ATTGAGTTAT TTGATTACTT TGTAAATCAT
ATGGATGAGG TTCCTGCAGA GTATAAGTGT CATGATTGTG ATCACCCAGA GATTCAAGTT
GCAGATTATG TTTCAGGTAT GACTGATAGA TATGCAACGA GAGTGTTTGA AGATCTTCGT
CTACCTCGTT CCTGGGGTAA AAGAAGATAT GTAAAGTAA
 
Protein sequence
MLSIQREDLE AREHQILSPE AAFSDQSKGR AVAEEPDQYR TCYQCDRDRI LHSKSFRRLA 
HKTQVFLAPE GDHYRTRLIH TLEVSQIARS IARPLGLNED LTEAIALGHD LGHTPFGHIG
EKALSFAISL YRGMDPDAPE NEYIFAHNQQ SARIVEYLEK DGQGLNLSYE VVDGIRCHSG
NLRAETAEGR IVAISDRIAY VTHDIDDAKR AGLLSEEYLP TEAREVLGNS SPERIEHMVH
DIVSESSRVG DIKMTDSMWS AMMTMRAFLF ANLYASGDAK YEEPKAYDLI IELFDYFVNH
MDEVPAEYKC HDCDHPEIQV ADYVSGMTDR YATRVFEDLR LPRSWGKRRY VK