Gene Strop_0187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0187 
Symbol 
ID5056623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp212467 
End bp213462 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content73% 
IMG OID640472457 
ProductNUDIX hydrolase 
Protein accessionYP_001157050 
Protein GI145592753 
COG category[L] Replication, recombination and repair 
COG ID[COG2816] NTP pyrophosphohydrolases containing a Zn-finger, probably nucleic-acid-binding 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.805777 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCCTC CCCCGATCCT CGCGTGTCGG TGTTGGCGTG CGACGGTGGG GTCCATGCGG 
GAGGACGACG TGGCTCTGGC GTACGGCGGT GGTTGGGTGG ACCGGGCCGG TGCCCTCCGT
GCCGATCCGG ATCGGCTGGC GACGCTGCTG GGCGCTGTAG ACACCAGAGT GCTGCCGCTG
TGGCAGGACC GCTGCCTGGT CAACGGGACG GCGCCGGTTC GGCTGAGCGG CGAAGCGGCG
GCGAGGGCGC ACGTCGCCGC GCGGGAAACG GTATTCCTCG GGTTCGCGGC GGGGCGGGCA
GTCTTCGCGG TGGACCTCTC CGAGCTCGCT GAGGATGCCG CTCTGGCGGC TGTCGGGGCG
ACTCGGGTGG TGGATGTGCG CGGGCTCGTC GGGCCGCTCA GCCCGGCCGA GGCGGCTATC
CAGGCGTACG CGCGCGGTCT GCTGCACTGG CACCGGCAGC AGCGGTACTG CGGCACCTGC
GGGGGGTCGA CCAGCGTCCA GGACGCCGGG CACGCCCGGC GGTGCGCGGA TCCCACCTGT
GCCCGACTGT ACTTCCCCCG GATCGAGCCC GCCATCATTG TGCTGGTGGA GACAGCGGGC
TCGCCCGGGC GTTGCCTGTT GGCCCGGCAC GCCGGGGCGG CCGAGGGTGC GTTCTCGACC
CTCGCCGGCT TCGTCGAGGT GGGGGAGACG CTGGAAGACG CGGTCCGGCG GGAGGTAGCC
GAAGAAGCGG GGGTGGTGGT GACCGACGTG GCGTACCAGG GGTCGCAGGC CTGGCCGTTC
CCGGCGGGGC TGATGGTGGG CTTTCGGGCC ACCGCCGTAT CCGACGAGAT CCGGGTGGAC
GGGGTCGAGC TGCTGGAGGC ACGCTGGTTC ACCCGCGCCG AGCTGCGCCA ACGGGCTGCG
GTAGGCCACC CACTCGGTCG GCTGGACTCG ATCGGTCACC ACCTGCTCAG CAGCTGGCTG
GCCGAGGACG AAGCAGCCGC GTCAGCTCCC CGCTGA
 
Protein sequence
MSPPPILACR CWRATVGSMR EDDVALAYGG GWVDRAGALR ADPDRLATLL GAVDTRVLPL 
WQDRCLVNGT APVRLSGEAA ARAHVAARET VFLGFAAGRA VFAVDLSELA EDAALAAVGA
TRVVDVRGLV GPLSPAEAAI QAYARGLLHW HRQQRYCGTC GGSTSVQDAG HARRCADPTC
ARLYFPRIEP AIIVLVETAG SPGRCLLARH AGAAEGAFST LAGFVEVGET LEDAVRREVA
EEAGVVVTDV AYQGSQAWPF PAGLMVGFRA TAVSDEIRVD GVELLEARWF TRAELRQRAA
VGHPLGRLDS IGHHLLSSWL AEDEAAASAP R