Gene Strop_4242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4242 
Symbol 
ID5060727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4805754 
End bp4807088 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content71% 
IMG OID640476504 
ProductAllergen V5/Tpx-1 family protein 
Protein accessionYP_001161048 
Protein GI145596751 
COG category[S] Function unknown 
COG ID[COG2340] Uncharacterized protein with SCP/PR1 domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.152087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCGG ATGATAATCG CCGGCGACCC GAACCGGCGG CCGACGGCTG GGACCGGCCT 
GCCGACGAAT GGGACCGGCG GCAACAGACC GCGCACCGGC GGGCGGAGTC CGAGACCCCA
CAAGGCTGGG CAGCCGCGGG TAGCTCGTAC CATGACCAAC CCTCGCGGGG TGGGCAAGCC
GCCCCCACCT GGCAGCACAC CGACGGCTGG CAGCAGGAAT CAGCGGGCGG CTGGCGCCGG
CAGCCCACGG GCTGGCGAGA CGAGCCCTCC GGAGGCGGGC GTCACGCGGC TGGTGCCACC
GGAGGACCCG AGGTCACCGG CGGTTGGCGG CCTGAGGAGA CGGCCACCAG GCCGACCGCT
GGGTACGAAG ACGACATGAC CAGCATGCGG CCGGTCGCTG GCGCCGCTCC GGCCGACGCT
ACATCCGCCA ACAAGGGCCC TCGGGTCGGC CACCGGAACC GCCGGCCCGT GCTCATCGGG
GCCGCGGCGG CGGCCACGCT GGTCGTGAGC CTCGGAGCCG GCGCCGCCGC ACTGTCCGAC
GGTGGCGACA CCGTCCCCAC CTCGGCACTC AAGGACATTG TGGCCACGAA TCCGACCTCG
TATGAGGGGG GCGTGACGTC GTCCTCGAGC AGTCCCAGCA CCGCGTGGAG CAACGCCTCA
CAGTCGCGCT CGGAAAACGC GCGCCGGAAG TCAGCCGCAT CGTCGCGTCA TTCCACCAGG
TCCCGCTTCG ACCAGGAACG CCACAGGGGC AGCTACCGCT CCAAGCCGTC ACATACCACC
ACCGCGCCCA GCCCGACTCG GGTGCCCACC ACCGCGCCCA GCCCCACTCG GGTGCCCACC
ACCGCGCCCA GCCCGACTCA GGTACCCACC ACCGCGCCCA GCCCGACTCA GGTACCCACC
ACCGCGCCCA GCCCCACTCA GGTACCCACC ACCGCACCCT CCGACGGCGG GGTGAGCACC
GAGGCCAGCG AGGTGGTCCG GCTGGTCAAC GCCGAGCGTG CGAAGGCTGG CTGCGCGGCA
CTGAGTATCG ACGAGAAGCT GATGACCGCC GCCCAGCGGC ACAGCCAGGA CCAGGCCGAC
CAGCAGAAGA TGTCGCACAC GGGCAGCAAC GGCAGTAGCC CCGGCGACCG GATCACCGCT
GTCGGCTACC AGTGGCGCAC CTACGGGGAA AACGTCGCCT GGAACCAGCA GTCCCCCGAA
GCCGTGATGA CCGCGTGGAT GAACAGCCCC GGCCACCGGG CGAATATCCT GAACTGCTCC
TTCACCGAGA TCGGGGTCGG CGTCGCGAGT AGCAACGGAC CGTACTGGAC ACAGGTCTTC
GCCACGCCTC GCTGA
 
Protein sequence
MEPDDNRRRP EPAADGWDRP ADEWDRRQQT AHRRAESETP QGWAAAGSSY HDQPSRGGQA 
APTWQHTDGW QQESAGGWRR QPTGWRDEPS GGGRHAAGAT GGPEVTGGWR PEETATRPTA
GYEDDMTSMR PVAGAAPADA TSANKGPRVG HRNRRPVLIG AAAAATLVVS LGAGAAALSD
GGDTVPTSAL KDIVATNPTS YEGGVTSSSS SPSTAWSNAS QSRSENARRK SAASSRHSTR
SRFDQERHRG SYRSKPSHTT TAPSPTRVPT TAPSPTRVPT TAPSPTQVPT TAPSPTQVPT
TAPSPTQVPT TAPSDGGVST EASEVVRLVN AERAKAGCAA LSIDEKLMTA AQRHSQDQAD
QQKMSHTGSN GSSPGDRITA VGYQWRTYGE NVAWNQQSPE AVMTAWMNSP GHRANILNCS
FTEIGVGVAS SNGPYWTQVF ATPR