Gene Strop_3729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3729 
Symbol 
ID5060207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4270993 
End bp4273977 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content67% 
IMG OID640475987 
Producthypothetical protein 
Protein accessionYP_001160538 
Protein GI145596241 
COG category[S] Function unknown 
COG ID[COG1615] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGCA GCGCCCCCAT GCCGAGGATG AGCCGACGCG GACGCGTCAC GATTGGGGTC 
CTGGTCGGGG TGTTCGTGCT CTTCACCCTG CTCGGTTGGG GGGTGCAGGC ATGGACCGAC
TGGCTCTGGT TCGGCGAGGT CGACTACACC GCGGTCTTCA GCGGGGTGCT CGTCACCCGG
CTGCTGCTCT TCGTCACGGT CGGTCTGGCC ATGGCGGTCA TCGTCGGCGG TAACCTCTGG
CTGGCGCATC GACTGCGGCC CCGGCTGCGA CCGCAGTCGC CGGAGCAGGC CACCCTGGAG
CGATACCGGA TGCTGCTCAG CCCCCGGATC GGCCTGTGGT TCGCGACGGT CTCCGTCGTG
GTGGGCCTCT TCGCGGGGCT GTCGGCGCAG AGCAGGTGGA GCGAGTGGCT GCTGTTCCGC
AACGGCGGGA ACTTCGGAGT CAAGGACCCG GAGTTCGGGG TGGACATCGG CTTCTACATC
TTTGACCTGC CGTTCTGGCG CTACCTGCTC GGGACCGCCT TCACCGCCGT GGTGTTGGCC
CTGCTCGGGG CGCTCGCTGT GCACTATGTC TTCGGCGGGA TCCGGCTTCA GGGCGTGGGT
GACCGGATGA GCACTGCGGC GCGGGCTCAC CTGAGCACGT TGGTCGCGGT CTTCGTGCTG
CTCAAGGCCG TCGCGTACGT CCTCGACCGG CGGACGATGC TGCTGGAGTA CAACGACGGC
GCCAACGTCT ACGGCGCCGG TTACGCCGAC ATCAACGCGT TGCTGCCGGC GAAGGAGATC
CTCGCCTACA TCTCGGTTGT CGTGGCGATC GCGGTCCTCG TCTTCTCCAA CGCCTGGATG
CGGAACCTGG TCTGGCCGGG CATCTCGCTG GCCCTGCTCG GGGTCTCCGC GGTCGCCATC
GGCGGCATCT ACCCGTGGGC GGTGCAGACC TTCGAGGTGA AGCCGAGTGC CCGGGACAAG
GAGGCGCGGT ACATCGAGCG CAGCATCGAG GCGACCCGGG CGGCCTTCAG TCTGGGCGAG
GCCGACGCCA CGCGGTACGC GGCGAGTAAC CTTCAGCCAC CGGCGAGCCT CGCCACCGAC
ACCGCGGTGG TACCGAATGC CCGACTGCTG GATCCGCAGC TGGTCAGCGA GACGTACACG
CAGCTTCAAC AGGTCCGCGG CTTCTACGAC TTCGGCCCCA AGCTCGACAT CGATCGCTAC
ACCGTCGATG GGGAGACCCA GGACTACGTG GTCGGGGTGC GTGAGATCAA CTATGGCGAG
CTGACCACGC AGCAGAGCAA CTGGATCAAC CGGCACACCG TCTACACCCA TGGTTATGGT
TTGGTCGCGG CCCCGGCGAA CCGGGTGGTC TGCGGTGGTC AGCCGTACTT CGTCTCCGGC
TTCCTCGGGG AGCGGTCGCA GGAGGGGTGT GCCGCGCAGA CCGACCAGAT ACCGGCCAGC
CAGCCGCGGA TCTACTACGG CGAGCGGATG GAGGCCGGCG ACTACGCCAT CGTCGGCAAG
GCAAACCCGG AGGCCAGTCC TGCCGAGTTC GATCGGCCGG TCGGCGAGGA CGGATCGGAG
TCCTACTACA CCTACACCGG CTCCGGCGGC GTCGAGGTCG GCTCGTTCGG TCGCCGGCTG
CTCTACGCGA TCAAGGAACA GGAGTCGAAC TTCCTGCTCT CCGAGGCGGT CAACGAGAAG
TCGAAGTTGC TCTACGTCCG TAACCCGCGG GAGCGGGTGG AGAAGGTGGC TCCGTTCCTC
ACCGTGGACG GCGACCCATA TCCGGCGGTG ATCGATGGCC GGGTGACGTG GATCATCGAT
GGCTACACCA CCGCCGCGAC CTACCCCTAC GCCGAGCGGA TCAACCTCCA GACCGAGACC
ACCGACGAGC TCACCAACCG GGGCACCTTC CAGCAGGCCC GGGAGAACAT CAACTACATC
CGTAACTCGG TCAAGGCGAC GGTCGACGCG TACGACGGCA CCGTCACCCT CTACGAGTTC
GACGACGGTG ACCCGGTGCT CCGGGCGTGG AACAAGGCCT TCGGCGGCGA CCTGATCAAG
CCGAAGGCGG AGATCCCGAC CGAGCTCAGT GCTCACTTCC GCTACCCGGC GGACCTGTTC
AAGGTGCAGC GGAACGTGTA CACCCGGTTC CACGTGACCA ACCCCGGTGA CTTCTACTCC
GGGCAGGACT TCTGGCAGGT GCCGAACGTG CCGGACGCAC CGGACAGCGG CCAGAAGCAG
CCCCCGTACT ACCTCTTCAC CCAGATGCCC GGGCAGGACG AGCCGCGTTT CCAGCTCACC
TCGGCGGTGA CCCCGAACCG ACGGCAGAAC CTCGCGGCGC TGATGTCCGG CTCGTACGTG
GACGGCAAGC CTCGGCTTGA GGTGTATGAG CTGCCGGAAG ACACCCGGAT CTCCGGTCCG
GTGCAGGTGC ACCAGCAGAT GACCAACAAC GCCCAGATCC GGCAGCAGCT GAACCTGCTC
TCGTCGAACC AGGCTCAGGT CCAGTACGGC AACCTGCTCT CCCTGCCGTT CGGAAACGGC
ATGCTCTACG TCGAGCCGGT CTATGTGAAG AGCAACCAAC AGCAGGCCTA TCCCCTGTTG
CAGAAGGTGC TGCTCTCCTA CGGCGACGGC GGTTCGTTCG TCGTCCTGGC CGACAACCTC
ACCGACGGCA TCAAACAGCT CGTCGAGCAG GGTGAACAGG CCGGCGCGCC ATCGCCTCCG
CCCTCCGACG ACGAGACGCC GCCGAGCCCA ACCCCAACCC CGACTCCGAC GACTCCGAGC
GTGACCCCAC CTCCGCTCAC GGGTGAGGTG GCTGAGGCGG CCCAGCGGGT TCAGGCGGCG
ATCGTGGAGC TCCGGGCCGC CCAGGAGTCC GGCGACTTCG AGCGCTACGG TCGGGCGTTG
CAGGCGTTGG ATGAGGCGAC CGCCGCCTTC GAGCAGGCAG CCGCGTCGAC CCCGGCTGCT
ACGCCGACCG CGGCACCCAC GGGTTCACCG TCGCCCGGAG GCTGA
 
Protein sequence
MRSSAPMPRM SRRGRVTIGV LVGVFVLFTL LGWGVQAWTD WLWFGEVDYT AVFSGVLVTR 
LLLFVTVGLA MAVIVGGNLW LAHRLRPRLR PQSPEQATLE RYRMLLSPRI GLWFATVSVV
VGLFAGLSAQ SRWSEWLLFR NGGNFGVKDP EFGVDIGFYI FDLPFWRYLL GTAFTAVVLA
LLGALAVHYV FGGIRLQGVG DRMSTAARAH LSTLVAVFVL LKAVAYVLDR RTMLLEYNDG
ANVYGAGYAD INALLPAKEI LAYISVVVAI AVLVFSNAWM RNLVWPGISL ALLGVSAVAI
GGIYPWAVQT FEVKPSARDK EARYIERSIE ATRAAFSLGE ADATRYAASN LQPPASLATD
TAVVPNARLL DPQLVSETYT QLQQVRGFYD FGPKLDIDRY TVDGETQDYV VGVREINYGE
LTTQQSNWIN RHTVYTHGYG LVAAPANRVV CGGQPYFVSG FLGERSQEGC AAQTDQIPAS
QPRIYYGERM EAGDYAIVGK ANPEASPAEF DRPVGEDGSE SYYTYTGSGG VEVGSFGRRL
LYAIKEQESN FLLSEAVNEK SKLLYVRNPR ERVEKVAPFL TVDGDPYPAV IDGRVTWIID
GYTTAATYPY AERINLQTET TDELTNRGTF QQARENINYI RNSVKATVDA YDGTVTLYEF
DDGDPVLRAW NKAFGGDLIK PKAEIPTELS AHFRYPADLF KVQRNVYTRF HVTNPGDFYS
GQDFWQVPNV PDAPDSGQKQ PPYYLFTQMP GQDEPRFQLT SAVTPNRRQN LAALMSGSYV
DGKPRLEVYE LPEDTRISGP VQVHQQMTNN AQIRQQLNLL SSNQAQVQYG NLLSLPFGNG
MLYVEPVYVK SNQQQAYPLL QKVLLSYGDG GSFVVLADNL TDGIKQLVEQ GEQAGAPSPP
PSDDETPPSP TPTPTPTTPS VTPPPLTGEV AEAAQRVQAA IVELRAAQES GDFERYGRAL
QALDEATAAF EQAAASTPAA TPTAAPTGSP SPGG