Gene Strop_3031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3031 
Symbol 
ID5059495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3462296 
End bp3463726 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content72% 
IMG OID640475281 
Productmembrane-flanked domain-containing protein 
Protein accessionYP_001159846 
Protein GI145595549 
COG category[S] Function unknown 
COG ID[COG3428] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.687058 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGAGC CGGCCGCGCG AGACCCGGAG GAGACCCGGC CACCCGCCCC ACAGCCGTAC 
CCCGGTTGGT ATCCACCCAC CCCGCCACCG CCCACCCCTC CGGTGCCCCC GCACGGAGGC
GAGCGCCGGC AACGGCTCCA TCCGCTCTCC CCTGCGCTGC ACGGTGCCAA GTCGCTGGTC
GTGGTAATCG CCGGGCTCTC CTGGTCAACG TTGTCCCGGG TCGGCTTCGG CTGGTTCGCG
GTGATGGCGG TGGTGCTGGC GATCGGTGCC ACCGTGCTCG CGGTGGTGAG CTGGTACAAC
ACCGGCTATC ACGTCGTGGA CCGCGAGCTG CGGGTACACG AGGGCCTGCT CTGGCGACGC
ATCCGGGCGA TCCCGCTGGA GCGGCTCCAG GCCGTGGAGG TGGTTCGGCC GTTACTCGCC
CAGCTCACCG GATTGGCGGA GCTGCGCCTC GAGGTGGTCG GCGGCGGCAA GACCGAGGCA
CCGCTGGCCT ATCTCAGCGT GGCCGACGCT GCCGCCCTGC GACAGCGGCT GCTCGCGTTG
GCCGGTCCGC AACCGGAGGG CGCCGCCCTG ACACCGGGCA CACCGCAGGT CTGGCCGGAA
GCGACTCCCC CTGGCGAACC GCTGCACGTC GTCCGGAACA CAGACCTGTT ACTGAGCCAG
CTACTCACCC CGCAGACCTT CATGATCCCG TTCGGGGTGG TCTTCGTGGC GACGCAGTTC
CTCTCCGGGG ATTCCTGGTC GTTCGTCGCG GTGGCGAGCA CGTTGACCGC GATGGCCGGG
GTGCTGCTAC AGCCGATCCG CCGAGTGCTG GACAACTGGA GCTTCCGGCT GGCTCGTGAT
GTCGACACCC TGCGGATCCG CAACGGCCTG CTGGAGACCC GGGCGCAGAC CGTCCCGCTG
CACCGGGTAC AGACGGTCGG TGCGACCTGG CCGCTGCTGT GGCGGGCGAA GGGCTGGCTG
CGGCTACGGC TGGAGGTGGC CGGCTACTCG ACGGGGGAAG CTGACGGGCG CAACCGACCA
GACCAGCTCC TCCCGGTCGG TCAGCAACCG GTCGCCGAGG CGATCCTCGC CGAGGTGCTG
CCCGGGGTAC GCCTCGACGA CCTGCCGCTT ACCGCACCGC CGACCCGGGC TCGCTGGCTG
AACCCCCTGA GTCAGCCAGT GCTCGGTGCC GGGCTGCACG AGCGAGTGTT CGCGGTCCGC
TCCGGCCTGC TCACCCGCCA GATCGTGGTC GTGCCGTACG CGAGGATCCA GAGCGTGCGG
GTGGTGCAGG GGCCGGTGCA GCGACGGTTG GGACTGGCGA CGGTGCACGC GGACACCGCC
GGCGGCGCCG GGGCCGCCGC CGACGACCGA GCGGTCGCCG AGGCGTGGGC CCTGGCCGCC
GAGCTGACAG CGCGTGCGCA CCAGGCGCGC CGCGCCAGCC GACCGCAGTA G
 
Protein sequence
MSEPAARDPE ETRPPAPQPY PGWYPPTPPP PTPPVPPHGG ERRQRLHPLS PALHGAKSLV 
VVIAGLSWST LSRVGFGWFA VMAVVLAIGA TVLAVVSWYN TGYHVVDREL RVHEGLLWRR
IRAIPLERLQ AVEVVRPLLA QLTGLAELRL EVVGGGKTEA PLAYLSVADA AALRQRLLAL
AGPQPEGAAL TPGTPQVWPE ATPPGEPLHV VRNTDLLLSQ LLTPQTFMIP FGVVFVATQF
LSGDSWSFVA VASTLTAMAG VLLQPIRRVL DNWSFRLARD VDTLRIRNGL LETRAQTVPL
HRVQTVGATW PLLWRAKGWL RLRLEVAGYS TGEADGRNRP DQLLPVGQQP VAEAILAEVL
PGVRLDDLPL TAPPTRARWL NPLSQPVLGA GLHERVFAVR SGLLTRQIVV VPYARIQSVR
VVQGPVQRRL GLATVHADTA GGAGAAADDR AVAEAWALAA ELTARAHQAR RASRPQ