Gene Noca_1945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1945 
Symbol 
ID4599850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2075341 
End bp2076612 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content75% 
IMG OID639776543 
Productputative deoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_923142 
Protein GI119716177 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACACC TGGAGCTGTA CGACGCCGCC GCGCGCGCGC GTCTGGTCGA GGAGCCGCCG 
AAGCGGGTCG ACGCCCCCGA GCGGACGCCG TTCGAGCGCG ACCGCGCCCG CCTCGTCCAC
GCGGCGGCCT CGCGGCGGCT GGCCGCGAAG ACCCAGGTGG TCGGCCCGCA GAGCAACGAC
TTCGTGCGCA ACCGGCTCAC CCACAGCCTC GAGGTCGCCC AGGTCGCGCG CGACCTCTCA
CGGGCCCTCG GCAGCCAGCC GGACATCGCC GAGACCGCGG CGCTGGCCCA CGACCTCGGG
CACCCGCCGT TCGGCCACAA CGGCGAGCGG GTGCTGGCCG AGCTCGGAGA GTCCTGCGGC
GGCTTCGAGG GCAACGCCCA GACCCTGCGG CTGCTCACCC GGCTCGAGGC GAAGACCGTG
GACGCCTCCG GTGCGTCGGT CGGCCTGAAC CTCACCCGGG CGACCCTGGA CGCCTGCACC
AAGTACCCCT GGCCGCGGTC GGCGGCCGAG GAGCCGCAAG GGGTGCACGC CGACGGGTCG
CCGCGGCTGG TGCGCAAGTT CGGCGTGTAC GACGACGACC GGCCGGTGTT CGACTGGATG
CGCCGGGGCG CGGTCGGCAC CCACCAGTGC CTCGAGGCGC AGGTGATGGA CCTGGCCGAC
GACGTCGCCT ACTCCGTCCA CGACATCGAG GACGGCATCG TCGCGGGCCG CGTCGACCTC
ACCCGGATCG ACGAGGCCGC GGTCTGGGCG ACGGTGCGCG ACTGGTACCT CCCCGACGCG
ACCGACGAGG TCCTCGGCGC GACCCTCGCC GGCCTGCGCG AGGTCGGCAG CTGGCCGGAG
GCGCCGTACG ACGCCAGCCG CCGCTCGCTG GGCGCGCTCA AGAACCTCAC CAGCGACCTG
ATCGGACGCT TCTGCGGGGC GGTGCAGCAC GCGACGTTCG CCGCGAGCGA CGGCCCGTTC
GTGCGCTACG CCGCCGATCT GGTGGTCCCC GAGCGGACCC GCCTGGAGAT GGCGGTGCTG
AAGGGCATCG CCGCCTACTA CGTGATGCAG GCCGACGACC GGGTCGCCGC GATGGTGCGC
CAGCGCGAGC TGCTCGCCGA GCTGGTCGCC GTCCTCGCCC ACCGCGGCCC GGATGCCCTC
GAGCGGGCGT TCGCCGACGA CTGGCGCGCC GCGGCCGACG ACGCGGCCCG CCTGCGGGTC
GTCATCGACC AGGTCGCCTC GCTGACCGAT GCCAGCGCGC TCACCTGGCA CGAGTCGCTC
CGCTCGCGCT GA
 
Protein sequence
MEHLELYDAA ARARLVEEPP KRVDAPERTP FERDRARLVH AAASRRLAAK TQVVGPQSND 
FVRNRLTHSL EVAQVARDLS RALGSQPDIA ETAALAHDLG HPPFGHNGER VLAELGESCG
GFEGNAQTLR LLTRLEAKTV DASGASVGLN LTRATLDACT KYPWPRSAAE EPQGVHADGS
PRLVRKFGVY DDDRPVFDWM RRGAVGTHQC LEAQVMDLAD DVAYSVHDIE DGIVAGRVDL
TRIDEAAVWA TVRDWYLPDA TDEVLGATLA GLREVGSWPE APYDASRRSL GALKNLTSDL
IGRFCGAVQH ATFAASDGPF VRYAADLVVP ERTRLEMAVL KGIAAYYVMQ ADDRVAAMVR
QRELLAELVA VLAHRGPDAL ERAFADDWRA AADDAARLRV VIDQVASLTD ASALTWHESL
RSR