Gene Strop_1289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1289 
Symbol 
ID5057742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1448267 
End bp1449598 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content72% 
IMG OID640473561 
ProductAllergen V5/Tpx-1 family protein 
Protein accessionYP_001158137 
Protein GI145593840 
COG category[S] Function unknown 
COG ID[COG2340] Uncharacterized protein with SCP/PR1 domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.323784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACGGCT GGAACGACCC GAGGAACCCA GACGGTAGCC GTCGGCAACC CGAACCGGCG 
GCCGATGAAC CAGCGTGGCC GACAGACCGC CCGGAGCCAC GCTCCGCCTA CCTGTTCGGT
GACGAGCCGG ATGGACCTCC TCACCGATGG GAGCCCACCG ACCGACGGGA GCAGCCCACC
GACGAGTGGG ACCGACAGCA GCCGACCGCG CACTGGCGGT CGGACGCCGA GCCCGCGCGA
GGCTGGGGAG CCGCGGAGCA CCCGTACCAC GGTCCCGTCG GTGACCCCTA CCACGAACAG
CCCACGCAGG GCTGGGAAGC CACCCCGACC TGGCAGCACG GAGAGCCCAC CCAGAACTGG
CAGCGGGAGC AGCCGGGCGA CCGGCCCGAC CAGCGGTTCG CGGGCGGACG GAACGAGCCC
ACCGGTAGCT GGCACGGCGC GGCCACCCCC ACCGGCGGGC CCGAGCCCAC CGGCCAGTGG
CAGGCCCCGG AGCCGACCGG CTGGTACGCC GACGAACCAA CGACCAGCAT GCCGTCCCTC
GCGGAAGCCG CAGCCGCCGG TGACGTGCCC GCCGGCGAGG ATCGGCCCCG TCGTCGGCAC
CGGCGACCGC TGCTCATCGG CGGAGCCGCG GCGGCAGCCA CACTGGTGGT GAGCCTCGGG
GTCGGCGCCG TTACCTTCGC CGGTGGCGGT GACGCCAGCC CCACCTCGGC CATCGACGAC
ATCGTGGCGA CGAACCCGAC CGAGGAGAGT GCGTTTCCCA GCGGCACGCC GACCTCGGCC
AGCCCCAGCG CCACGCCGAC CACGACGTCG CCATCACCGT CACCGTCGGT CACGCCGAGT
CGCAAGCCGA GCCCGACGGC CTCGCGCTCC ACCGCCGCCC CCCGGCCCAC CCCGAACCGC
ACCACCGCGC CCCCCACCAA CAGCACCACC GCGCCCGCCA ACGGCAACGT CAGCGAAGAC
GCGGCCGAGG TGGTTCGGCT GGCCAACATT GAGCGCAAGG AGGCCGGCTG CGCGGCGTTG
AGCATCGACG ACAAGCTGAT GACCGCAGCC CAGCGGCACA GCCAGGACCA GGCCGACAAC
CGGAAGATGT CACACGACGG CAGCAACGGC AGTAGCCCCG GAGACCGCAT CGACGATGTC
GGCTACCAGT GGCGCACCTA CGGGGAGAAC GTCGCCTGGA ACCAGCAGTC GCCCGCCGCG
GTGATGAAGG CGTGGATGAA CAGCTCCGGC CACCGGGCGA ACATCCTGAA CTGCTCCTTT
ACCGAAATCG GGATCGGCAT CGCGACCAGC AACGGACCCT ACTGGACGCA GGTCTTCGCC
GCGCCCCGTT GA
 
Protein sequence
MYGWNDPRNP DGSRRQPEPA ADEPAWPTDR PEPRSAYLFG DEPDGPPHRW EPTDRREQPT 
DEWDRQQPTA HWRSDAEPAR GWGAAEHPYH GPVGDPYHEQ PTQGWEATPT WQHGEPTQNW
QREQPGDRPD QRFAGGRNEP TGSWHGAATP TGGPEPTGQW QAPEPTGWYA DEPTTSMPSL
AEAAAAGDVP AGEDRPRRRH RRPLLIGGAA AAATLVVSLG VGAVTFAGGG DASPTSAIDD
IVATNPTEES AFPSGTPTSA SPSATPTTTS PSPSPSVTPS RKPSPTASRS TAAPRPTPNR
TTAPPTNSTT APANGNVSED AAEVVRLANI ERKEAGCAAL SIDDKLMTAA QRHSQDQADN
RKMSHDGSNG SSPGDRIDDV GYQWRTYGEN VAWNQQSPAA VMKAWMNSSG HRANILNCSF
TEIGIGIATS NGPYWTQVFA APR