Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1945 |
Symbol | |
ID | 4599850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 2075341 |
End bp | 2076612 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639776543 |
Product | putative deoxyguanosinetriphosphate triphosphohydrolase |
Protein accession | YP_923142 |
Protein GI | 119716177 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACACC TGGAGCTGTA CGACGCCGCC GCGCGCGCGC GTCTGGTCGA GGAGCCGCCG AAGCGGGTCG ACGCCCCCGA GCGGACGCCG TTCGAGCGCG ACCGCGCCCG CCTCGTCCAC GCGGCGGCCT CGCGGCGGCT GGCCGCGAAG ACCCAGGTGG TCGGCCCGCA GAGCAACGAC TTCGTGCGCA ACCGGCTCAC CCACAGCCTC GAGGTCGCCC AGGTCGCGCG CGACCTCTCA CGGGCCCTCG GCAGCCAGCC GGACATCGCC GAGACCGCGG CGCTGGCCCA CGACCTCGGG CACCCGCCGT TCGGCCACAA CGGCGAGCGG GTGCTGGCCG AGCTCGGAGA GTCCTGCGGC GGCTTCGAGG GCAACGCCCA GACCCTGCGG CTGCTCACCC GGCTCGAGGC GAAGACCGTG GACGCCTCCG GTGCGTCGGT CGGCCTGAAC CTCACCCGGG CGACCCTGGA CGCCTGCACC AAGTACCCCT GGCCGCGGTC GGCGGCCGAG GAGCCGCAAG GGGTGCACGC CGACGGGTCG CCGCGGCTGG TGCGCAAGTT CGGCGTGTAC GACGACGACC GGCCGGTGTT CGACTGGATG CGCCGGGGCG CGGTCGGCAC CCACCAGTGC CTCGAGGCGC AGGTGATGGA CCTGGCCGAC GACGTCGCCT ACTCCGTCCA CGACATCGAG GACGGCATCG TCGCGGGCCG CGTCGACCTC ACCCGGATCG ACGAGGCCGC GGTCTGGGCG ACGGTGCGCG ACTGGTACCT CCCCGACGCG ACCGACGAGG TCCTCGGCGC GACCCTCGCC GGCCTGCGCG AGGTCGGCAG CTGGCCGGAG GCGCCGTACG ACGCCAGCCG CCGCTCGCTG GGCGCGCTCA AGAACCTCAC CAGCGACCTG ATCGGACGCT TCTGCGGGGC GGTGCAGCAC GCGACGTTCG CCGCGAGCGA CGGCCCGTTC GTGCGCTACG CCGCCGATCT GGTGGTCCCC GAGCGGACCC GCCTGGAGAT GGCGGTGCTG AAGGGCATCG CCGCCTACTA CGTGATGCAG GCCGACGACC GGGTCGCCGC GATGGTGCGC CAGCGCGAGC TGCTCGCCGA GCTGGTCGCC GTCCTCGCCC ACCGCGGCCC GGATGCCCTC GAGCGGGCGT TCGCCGACGA CTGGCGCGCC GCGGCCGACG ACGCGGCCCG CCTGCGGGTC GTCATCGACC AGGTCGCCTC GCTGACCGAT GCCAGCGCGC TCACCTGGCA CGAGTCGCTC CGCTCGCGCT GA
|
Protein sequence | MEHLELYDAA ARARLVEEPP KRVDAPERTP FERDRARLVH AAASRRLAAK TQVVGPQSND FVRNRLTHSL EVAQVARDLS RALGSQPDIA ETAALAHDLG HPPFGHNGER VLAELGESCG GFEGNAQTLR LLTRLEAKTV DASGASVGLN LTRATLDACT KYPWPRSAAE EPQGVHADGS PRLVRKFGVY DDDRPVFDWM RRGAVGTHQC LEAQVMDLAD DVAYSVHDIE DGIVAGRVDL TRIDEAAVWA TVRDWYLPDA TDEVLGATLA GLREVGSWPE APYDASRRSL GALKNLTSDL IGRFCGAVQH ATFAASDGPF VRYAADLVVP ERTRLEMAVL KGIAAYYVMQ ADDRVAAMVR QRELLAELVA VLAHRGPDAL ERAFADDWRA AADDAARLRV VIDQVASLTD ASALTWHESL RSR
|
| |