Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_3729 |
Symbol | |
ID | 5060207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 4270993 |
End bp | 4273977 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640475987 |
Product | hypothetical protein |
Protein accession | YP_001160538 |
Protein GI | 145596241 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAGCA GCGCCCCCAT GCCGAGGATG AGCCGACGCG GACGCGTCAC GATTGGGGTC CTGGTCGGGG TGTTCGTGCT CTTCACCCTG CTCGGTTGGG GGGTGCAGGC ATGGACCGAC TGGCTCTGGT TCGGCGAGGT CGACTACACC GCGGTCTTCA GCGGGGTGCT CGTCACCCGG CTGCTGCTCT TCGTCACGGT CGGTCTGGCC ATGGCGGTCA TCGTCGGCGG TAACCTCTGG CTGGCGCATC GACTGCGGCC CCGGCTGCGA CCGCAGTCGC CGGAGCAGGC CACCCTGGAG CGATACCGGA TGCTGCTCAG CCCCCGGATC GGCCTGTGGT TCGCGACGGT CTCCGTCGTG GTGGGCCTCT TCGCGGGGCT GTCGGCGCAG AGCAGGTGGA GCGAGTGGCT GCTGTTCCGC AACGGCGGGA ACTTCGGAGT CAAGGACCCG GAGTTCGGGG TGGACATCGG CTTCTACATC TTTGACCTGC CGTTCTGGCG CTACCTGCTC GGGACCGCCT TCACCGCCGT GGTGTTGGCC CTGCTCGGGG CGCTCGCTGT GCACTATGTC TTCGGCGGGA TCCGGCTTCA GGGCGTGGGT GACCGGATGA GCACTGCGGC GCGGGCTCAC CTGAGCACGT TGGTCGCGGT CTTCGTGCTG CTCAAGGCCG TCGCGTACGT CCTCGACCGG CGGACGATGC TGCTGGAGTA CAACGACGGC GCCAACGTCT ACGGCGCCGG TTACGCCGAC ATCAACGCGT TGCTGCCGGC GAAGGAGATC CTCGCCTACA TCTCGGTTGT CGTGGCGATC GCGGTCCTCG TCTTCTCCAA CGCCTGGATG CGGAACCTGG TCTGGCCGGG CATCTCGCTG GCCCTGCTCG GGGTCTCCGC GGTCGCCATC GGCGGCATCT ACCCGTGGGC GGTGCAGACC TTCGAGGTGA AGCCGAGTGC CCGGGACAAG GAGGCGCGGT ACATCGAGCG CAGCATCGAG GCGACCCGGG CGGCCTTCAG TCTGGGCGAG GCCGACGCCA CGCGGTACGC GGCGAGTAAC CTTCAGCCAC CGGCGAGCCT CGCCACCGAC ACCGCGGTGG TACCGAATGC CCGACTGCTG GATCCGCAGC TGGTCAGCGA GACGTACACG CAGCTTCAAC AGGTCCGCGG CTTCTACGAC TTCGGCCCCA AGCTCGACAT CGATCGCTAC ACCGTCGATG GGGAGACCCA GGACTACGTG GTCGGGGTGC GTGAGATCAA CTATGGCGAG CTGACCACGC AGCAGAGCAA CTGGATCAAC CGGCACACCG TCTACACCCA TGGTTATGGT TTGGTCGCGG CCCCGGCGAA CCGGGTGGTC TGCGGTGGTC AGCCGTACTT CGTCTCCGGC TTCCTCGGGG AGCGGTCGCA GGAGGGGTGT GCCGCGCAGA CCGACCAGAT ACCGGCCAGC CAGCCGCGGA TCTACTACGG CGAGCGGATG GAGGCCGGCG ACTACGCCAT CGTCGGCAAG GCAAACCCGG AGGCCAGTCC TGCCGAGTTC GATCGGCCGG TCGGCGAGGA CGGATCGGAG TCCTACTACA CCTACACCGG CTCCGGCGGC GTCGAGGTCG GCTCGTTCGG TCGCCGGCTG CTCTACGCGA TCAAGGAACA GGAGTCGAAC TTCCTGCTCT CCGAGGCGGT CAACGAGAAG TCGAAGTTGC TCTACGTCCG TAACCCGCGG GAGCGGGTGG AGAAGGTGGC TCCGTTCCTC ACCGTGGACG GCGACCCATA TCCGGCGGTG ATCGATGGCC GGGTGACGTG GATCATCGAT GGCTACACCA CCGCCGCGAC CTACCCCTAC GCCGAGCGGA TCAACCTCCA GACCGAGACC ACCGACGAGC TCACCAACCG GGGCACCTTC CAGCAGGCCC GGGAGAACAT CAACTACATC CGTAACTCGG TCAAGGCGAC GGTCGACGCG TACGACGGCA CCGTCACCCT CTACGAGTTC GACGACGGTG ACCCGGTGCT CCGGGCGTGG AACAAGGCCT TCGGCGGCGA CCTGATCAAG CCGAAGGCGG AGATCCCGAC CGAGCTCAGT GCTCACTTCC GCTACCCGGC GGACCTGTTC AAGGTGCAGC GGAACGTGTA CACCCGGTTC CACGTGACCA ACCCCGGTGA CTTCTACTCC GGGCAGGACT TCTGGCAGGT GCCGAACGTG CCGGACGCAC CGGACAGCGG CCAGAAGCAG CCCCCGTACT ACCTCTTCAC CCAGATGCCC GGGCAGGACG AGCCGCGTTT CCAGCTCACC TCGGCGGTGA CCCCGAACCG ACGGCAGAAC CTCGCGGCGC TGATGTCCGG CTCGTACGTG GACGGCAAGC CTCGGCTTGA GGTGTATGAG CTGCCGGAAG ACACCCGGAT CTCCGGTCCG GTGCAGGTGC ACCAGCAGAT GACCAACAAC GCCCAGATCC GGCAGCAGCT GAACCTGCTC TCGTCGAACC AGGCTCAGGT CCAGTACGGC AACCTGCTCT CCCTGCCGTT CGGAAACGGC ATGCTCTACG TCGAGCCGGT CTATGTGAAG AGCAACCAAC AGCAGGCCTA TCCCCTGTTG CAGAAGGTGC TGCTCTCCTA CGGCGACGGC GGTTCGTTCG TCGTCCTGGC CGACAACCTC ACCGACGGCA TCAAACAGCT CGTCGAGCAG GGTGAACAGG CCGGCGCGCC ATCGCCTCCG CCCTCCGACG ACGAGACGCC GCCGAGCCCA ACCCCAACCC CGACTCCGAC GACTCCGAGC GTGACCCCAC CTCCGCTCAC GGGTGAGGTG GCTGAGGCGG CCCAGCGGGT TCAGGCGGCG ATCGTGGAGC TCCGGGCCGC CCAGGAGTCC GGCGACTTCG AGCGCTACGG TCGGGCGTTG CAGGCGTTGG ATGAGGCGAC CGCCGCCTTC GAGCAGGCAG CCGCGTCGAC CCCGGCTGCT ACGCCGACCG CGGCACCCAC GGGTTCACCG TCGCCCGGAG GCTGA
|
Protein sequence | MRSSAPMPRM SRRGRVTIGV LVGVFVLFTL LGWGVQAWTD WLWFGEVDYT AVFSGVLVTR LLLFVTVGLA MAVIVGGNLW LAHRLRPRLR PQSPEQATLE RYRMLLSPRI GLWFATVSVV VGLFAGLSAQ SRWSEWLLFR NGGNFGVKDP EFGVDIGFYI FDLPFWRYLL GTAFTAVVLA LLGALAVHYV FGGIRLQGVG DRMSTAARAH LSTLVAVFVL LKAVAYVLDR RTMLLEYNDG ANVYGAGYAD INALLPAKEI LAYISVVVAI AVLVFSNAWM RNLVWPGISL ALLGVSAVAI GGIYPWAVQT FEVKPSARDK EARYIERSIE ATRAAFSLGE ADATRYAASN LQPPASLATD TAVVPNARLL DPQLVSETYT QLQQVRGFYD FGPKLDIDRY TVDGETQDYV VGVREINYGE LTTQQSNWIN RHTVYTHGYG LVAAPANRVV CGGQPYFVSG FLGERSQEGC AAQTDQIPAS QPRIYYGERM EAGDYAIVGK ANPEASPAEF DRPVGEDGSE SYYTYTGSGG VEVGSFGRRL LYAIKEQESN FLLSEAVNEK SKLLYVRNPR ERVEKVAPFL TVDGDPYPAV IDGRVTWIID GYTTAATYPY AERINLQTET TDELTNRGTF QQARENINYI RNSVKATVDA YDGTVTLYEF DDGDPVLRAW NKAFGGDLIK PKAEIPTELS AHFRYPADLF KVQRNVYTRF HVTNPGDFYS GQDFWQVPNV PDAPDSGQKQ PPYYLFTQMP GQDEPRFQLT SAVTPNRRQN LAALMSGSYV DGKPRLEVYE LPEDTRISGP VQVHQQMTNN AQIRQQLNLL SSNQAQVQYG NLLSLPFGNG MLYVEPVYVK SNQQQAYPLL QKVLLSYGDG GSFVVLADNL TDGIKQLVEQ GEQAGAPSPP PSDDETPPSP TPTPTPTTPS VTPPPLTGEV AEAAQRVQAA IVELRAAQES GDFERYGRAL QALDEATAAF EQAAASTPAA TPTAAPTGSP SPGG
|
| |