Gene Strop_0994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0994 
Symbol 
ID5057440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1111234 
End bp1112421 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content69% 
IMG OID640473264 
Producttransposase, IS4 family protein 
Protein accessionYP_001157847 
Protein GI145593550 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGATCAT CATCGCTGAT TGATGTGTTG GCCGTGCACG CGGACCACCT GGATGAGTCG 
TTGCCGGTCC AACGCCGGTC GAGCCTGGTC ACGGCGTTGG CGGCAGTGCC GGACCGGCGA
GACCCCCGCG GTGTCGTCCA TGCGTTGCCC GCGGTCCTGG CCACTGCCGT CGCGGCGGTG
CTGACCGGTG CCCGCTCCGC GGCGGCGGTC GCGGAATGGG CCGCTGATGC GCCGCAGCAG
GTCCTCACTG AACTGGGCGT GTTCCGGGAT CCGTTCACCG GGGTGCATCG AGCTCCGGAC
GAGTCCACGT TCCGGCGAAT CCTGGCCGGT GTCGACGCCG ACGCCCTGGA CGACACGGTC
GGTCGGTGGG TCCTCGCGTG CCAGCCGGCG GCGACCACCG GGCGACGGGT GTACAGCGTG
GACGGCAAAA CCTTGCGGGG CAGCGGCCCG GCCGGTGAAC AGGTGCATCT CCTCGCAGTG
CTCGACCAGC ACACCGGGAC GGTCCTGGGC CAGGTCGACG TCGACGGCAA GACCAACGAG
CTGACCCGCT TCCAGCCGCT GCTGGGCCCT CTCGACCTGA CCGCGGTGGT CGTCACCGCC
GACGCGTTAC ACACCCAGCG CGAGCATGCC CGCTGGCTGG TCGACACCAA GAAGGCCGCC
TACGTCTTCA CCGTGAAGAA GAACCAGCCG CGTCTGTATC GGCAGCTCAA GACCCTGCCC
TGGACGAAGA TCCCGATCCA GGACGAGACC AGCACCCGCG GGCACGGCCG CTACGACATC
CGCCGCCTGC AAGCAGTCAC CTGCACCGGG CCACTCGCCC TGGACTTTCC CCACGCCGTA
CAAGCACTAC GGATCCGCCG CCGACGGCTG AACCTGGCCA CTGGCCGCTG GTCCACCGTC
ACCGTCTACG CGATCACCAA CCTGAGCGCA GCCCAGGCCG GCCCTGCCGA ACTGGCCGAC
TGGCTGCGCG GGCACTGGGC CATCGAAACT CTGCACCACA TCCGCGACAC CACCTACGCC
GAGGACGCCA GCCGCCTACG CACCGGCAAC GCACCCCGCG CCATGGCCAC TCTGCGCAAC
ACGGCGATCA ACCTGCTCCG GCTAACCGGC ATCACCACCA TCGCTGCAGC CCTACGCCAC
AACAGCCGAA ACCCATACCG GCCACTCCAA CTACTCGGAC TCGACTGA
 
Protein sequence
MGSSSLIDVL AVHADHLDES LPVQRRSSLV TALAAVPDRR DPRGVVHALP AVLATAVAAV 
LTGARSAAAV AEWAADAPQQ VLTELGVFRD PFTGVHRAPD ESTFRRILAG VDADALDDTV
GRWVLACQPA ATTGRRVYSV DGKTLRGSGP AGEQVHLLAV LDQHTGTVLG QVDVDGKTNE
LTRFQPLLGP LDLTAVVVTA DALHTQREHA RWLVDTKKAA YVFTVKKNQP RLYRQLKTLP
WTKIPIQDET STRGHGRYDI RRLQAVTCTG PLALDFPHAV QALRIRRRRL NLATGRWSTV
TVYAITNLSA AQAGPAELAD WLRGHWAIET LHHIRDTTYA EDASRLRTGN APRAMATLRN
TAINLLRLTG ITTIAAALRH NSRNPYRPLQ LLGLD