Gene Strop_3428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3428 
Symbol 
ID5059897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3933508 
End bp3934773 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content73% 
IMG OID640475677 
Productputative deoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_001160237 
Protein GI145595940 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.123721 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.137486 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCGG TCGGCCCGGA CGCGCGCCGG TGGGTGGACG AGCCCGCGAA GGACACCGGG 
CAGGGCCGGT CGGCATTCGA GCGAGACCGC GCCCGGGTGC TGCACTCGGC GGCCTTCCGG
CGACTCGCCG CCAAGACTCA GGTACACACG GCCGGCACCG ACGACTTCCT CCGAACCCGG
TTGACCCACT CGTTGGAGGT CGCCCAGATC GCCCGCGAGA TGGGCAGCCG ACTCGGCTGC
GACCCCGACG TGGTGGACAC CGCCGGGCTC GCCCACGACC TCGGGCACCC GCCGTTCGGG
CACAGCGGCG AGGAGGCACT GGACGCCCTC GCCACGACCT GCGGCGGCTT CGAGGGCAAC
GCGCAGACGC TGCGCGTACT CACCCGCCTG GAGGCGAAGG TGATCGGCCC GAACGGCGCC
TCCGCCGGGC TGAATCTCAC CCGGGCTTCC CTCGACGCGG TCAGCAAGTA TCCCTGGCCG
CGCCGGACGG GGGAGCGCAA GTTCGGTGTG TACGCCGATG ATCACCCGGT CTTCGAGTGG
CTGCGCGCGG ACGTGCCGGA CGGGCGGCGG TGTCTGGAGG CGCAGGTGAT GGACTGGGCC
GACGATGTCG CGTACTCGGT GCACGACGTC GAGGACGGCA TTCACGGCGG GTACGTGACG
CTGCGTCCGC TACTGGCCGA CGCCGACGAG CGGGCGGCGC TGAGTGCCGA TGTCGCGACG
ACCTACTCCG GCGAGTCCTC CTCCGACCTC GCTGAGGTGC TGGCTGACCT ACTCGCCGAC
CCACTGCTCG CGTCGCTATC GGGCTACGAC GGCAGTTACC GGGCGCAGAC CGCGCTGAAG
GCGACCACCA GCGCACTGAC CGGTCGCTTC GTCGCCGCCG CTGTGGCCGC CACCCGGCGC
CGGTTCGGGC CCGGCCCGCA CCGGCGGTAC GCCGCCGACC TGGTCGTGCC ACGCGAGGTC
CGGGCCCGGT GCGCGCTGCT CAAGGGCATC GCCCTGCGGT ACGTGCTGCG TCGCCCCGGC
TCCGCCGCCC GTTACGAGCG GCAACAACAG CTCCTCGCCG GACTGGTCGC GGGCCTGGCC
GACCGGGCGC CCGAGGCGTT GGACGCGGTG TTCGCCCCGC TGTGGCGGGC TGCCGGTGAC
GACGCGGCCC GGCTGCGGGT GGTGGTGGAT CAGGTGGCAT CGTTGACGGA TCCGGCGGCG
GTGCAGCGGC ACGCCCGGCT CTTTGGTGGC CCGACCGACG CCGGCGGGTG GTGCGACTTA
GGTTAA
 
Protein sequence
MSPVGPDARR WVDEPAKDTG QGRSAFERDR ARVLHSAAFR RLAAKTQVHT AGTDDFLRTR 
LTHSLEVAQI AREMGSRLGC DPDVVDTAGL AHDLGHPPFG HSGEEALDAL ATTCGGFEGN
AQTLRVLTRL EAKVIGPNGA SAGLNLTRAS LDAVSKYPWP RRTGERKFGV YADDHPVFEW
LRADVPDGRR CLEAQVMDWA DDVAYSVHDV EDGIHGGYVT LRPLLADADE RAALSADVAT
TYSGESSSDL AEVLADLLAD PLLASLSGYD GSYRAQTALK ATTSALTGRF VAAAVAATRR
RFGPGPHRRY AADLVVPREV RARCALLKGI ALRYVLRRPG SAARYERQQQ LLAGLVAGLA
DRAPEALDAV FAPLWRAAGD DAARLRVVVD QVASLTDPAA VQRHARLFGG PTDAGGWCDL
G