Gene Avin_18960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_18960 
Symbol 
ID7760830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1886349 
End bp1887755 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content67% 
IMG OID643804794 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_002799083 
Protein GI226944010 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTGGC AGTCGTGCGG CGATCCGGGC GCCGCCCTCC CGCAGCGGCT GCCCACCGAA 
CGACGAGGAA CGGCTTTGGA CTGGCAAACC CTACTGACCC GAGAACGCCT GGGCAAATCG
GTGCACAGCG TCGATGAACT GGGCCGCAGC CCTTTCCACA AGGACCACGA CCGCATCATC
TTTTCCGGCG CCTTCCGCCG CCTGGGCCGC AAGACCCAAG TGCATCCGGT GTCCAGCAAC
GACCATATCC ACACCCGCCT GACCCACTCC CTGGAAGTCA GTTGCGTCGG CCGCTCGCTG
GGCATGCGGG TCGGCGAGGT GCTGCGCGAC ACCCTGCCGG AATGGTGCGG CCCCGCCGAC
CTCGGCATGG TCGTCCAGTC GGCCTGCCTG GCTCACGACA TCGGCAACCC GCCGTTCGGC
CATTCCGGCG AGGATGCCAT CCGCCACTGG TTCCACCAGG CCGCCGGACG CGGCTGGCTG
GACGGCATGA GCGACGCGGA ACGCGACGAC TTCCTGCATT TCGAGGGCAA CGCCCAGGGC
TTTCGCGTAC TCACCCAACT GGAATACCAC CAGTTCGACG GCGGCACCCG CCTGACCTAC
GCCACCCTCG GCGCCTACCT CAAATACCCC TGGGCATCGC GCTACGCACA GGCTCCGGGC
TACAAGAAGC ACAAGTTCGG CTGCTACCAG AGCGAACTGC CGCTGCTCGA ACAGATCGCC
GAGAAGCTCG GCCTGCCCAG GCAGGGCGAG CAGCGCTGGG CGCGTCATCC GCTGGTCTAT
CTGATGGAGG CGGCGGACGA CATCTGCTAC GCGCTGATCG ACCTGGAAGA CGGCCTGGAA
ATGGAGCTTT TGGACTACTC CGAGGTCGAG GCCCTGCTGC TCGGCCTGGT CGGCGACGAC
CTGCCGGAAT CCTACCGCCA GCTCGGTCCG CGCGACTCGC GGCGGCGCAA ACTGGCGATC
CTGCGCGGCA AGGCCATCGA ACACCTGACC AACGCGGCGG CCCGCGCTTT CGTCGAGCAG
CAGAAGGCCC TGCTCGAGGG CAGCCTGGCC GGCGACCTGG TCGAACACAT GCACGGACCG
GCCAAGGATT GCGTGCTGCA GGCCAAGCAT GTCGCCCGCG AGAAGATCTT CCACGACAAG
CGCAAGACCC TCCACGAGAT CGGCGCCTAC ACCACCCTGG AGATCCTGCT CGACGCCTTC
TGCGGCGCGG CGCTGGAGCA GCACGGCGGC AGGCGGATAT CGTTCAAGAA CCGGCGCATC
CTCGACCTGC TCGGCAACAA CGCACCGGAC CCGCAGTGGC CGCTGTACCA CGCCTTCCTG
CGCACGATCG ACTTCATCGC CGGCATGACC GACGGTTACG CCACCGAGAT GGCCCGGCAA
ATGACCGGCC TCTCCGGTCC CGCGTAG
 
Protein sequence
MRWQSCGDPG AALPQRLPTE RRGTALDWQT LLTRERLGKS VHSVDELGRS PFHKDHDRII 
FSGAFRRLGR KTQVHPVSSN DHIHTRLTHS LEVSCVGRSL GMRVGEVLRD TLPEWCGPAD
LGMVVQSACL AHDIGNPPFG HSGEDAIRHW FHQAAGRGWL DGMSDAERDD FLHFEGNAQG
FRVLTQLEYH QFDGGTRLTY ATLGAYLKYP WASRYAQAPG YKKHKFGCYQ SELPLLEQIA
EKLGLPRQGE QRWARHPLVY LMEAADDICY ALIDLEDGLE MELLDYSEVE ALLLGLVGDD
LPESYRQLGP RDSRRRKLAI LRGKAIEHLT NAAARAFVEQ QKALLEGSLA GDLVEHMHGP
AKDCVLQAKH VAREKIFHDK RKTLHEIGAY TTLEILLDAF CGAALEQHGG RRISFKNRRI
LDLLGNNAPD PQWPLYHAFL RTIDFIAGMT DGYATEMARQ MTGLSGPA