Gene Strop_1703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1703 
Symbol 
ID5058162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1948599 
End bp1951703 
Gene Length3105 bp 
Protein Length1034 aa 
Translation table11 
GC content68% 
IMG OID640473975 
Productpeptidase M14, carboxypeptidase A 
Protein accessionYP_001158545 
Protein GI145594248 
COG category[S] Function unknown 
COG ID[COG4412] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.504762 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCGCA CACGGCTGGC GATTGCCGGC GTGTTTACCT TGGTCGGCGC GTTGGCACTG 
ACCGCACCGG CAACCGCACA ACCGGCGTCC GAACCGGGCA GCCGGGACAG TCTGGAGGTG
TACGTCGGCA CGGTTGACCC GGAGCAGCTG GAGAAGCTGC GGCACGCCGG GGTCGACCTC
GGCCACGAGC ACACCGAGAC CGACCGGTCC GGCGACATCC GCGTCGAGAC GGTTCTCAGC
AAACGGCAGG CGGCCCGGTT GGCCAGCCAG GGCGTGCAGC TGGAGGTCAA GAAGGTACGG
GGCAAGCCGG CCAGCGAGGC GCTGCGGGAG CAGGCCGCCA CCGGCTGGTC CGCGTTCCGG
TCCTACAGCG AGCCGGGCGG AATTCGAGAC GAGATCACCG CCACCGCCGC CCGCTACCCG
GAGCTGACGA AGGTGATGAC GATCGGCCGC AGCCACCAGG GCAAGCCGAT CCTCGCCGTC
AAGGTGACCA AGAACGCGGA GAAGACCCGC GACGGCAAAC GGCCGGCGGT GCTCTACGCC
AGCACGCAGC ACGCCCGCGA GTGGATCACA CCGGAGATGA CTCGGCGGCT GATGCACCAC
GTGCTCGACA ACTACGGCAC GGACCGGGAC ATAACCCGGC TGGTGGACAC CACCGAGTTG
TGGTTCCTAC CCGTCGCCAA CCCGGACGGC TACGACCACA CCTTCACGCC CGGTAACCGG
CTCTGGCGCA AGAATCTTCG GGACAACGAC GGCGACGGGC AGATCACCAC CGCCGACGGC
GTCGACCTGA ACCGCAACTT CGGCTACAAG TGGGGATACG ACAACGAGGG GTCCTCCCCC
GACCCGATCA GCAACACCTA CCGGGGGCCC AGCCCGCACT CGGAGCCCGA GACCCGGGCG
CTGGACCAGC TGTTCCGACG GGTCGGCTTC GAGTTCTTCG TGAACTACCA CTCCGCCGCC
CAACTGCTGC TCTACGGCGT GGGCTGGCAG GTCGCCACCC CCACCCCGGA CGACATCATC
TACGAGGCGA TGGTCGGTGA CGACGAGAAC CCGGCCGTAC CCGGCTACGA CCCGGACATC
TCCGCCGAGC TGTACACCAC CAACGGCGAC ACCGACACGC ACGCCACGGT CCGCTACGGC
ACCCTCGGCT TCACCCCGGA GATGTCGACC TGCCAGGCCG CGGCGGCCTC CGACCCGGAT
GACGAATGGC TACCGGAGGA CTGTGTCAGC GGCTTCATCT TCCCCGACGA CGAAAAATTG
ATCTCCGCGG AGGTGGCGAA GAACCTGCCG TTCGCCCTCG CCGTGGCGCA GTCGGCCCAC
GACCCGGACG AGCCGGTGTC GGTCGTGGGC CGCAGCACCC CGGACTTCGT GGTGGACAGC
TTCGACACGT CCTACGGCCG CAACCAGCAG GTCGCCTCGA TCACCCGACG AGCGTTGCGC
AACGTCCGGA TGCACTACAC GATCAACGAC GGCCGGACCA AGACCGTCAG CGTCCGCGAG
TGGCGGGGCG GTGAGCGCTA CGGCGACACC CACGACGACT ACTACGCCGA GCTGCGGGGC
ACAGTGCAGG GTGCCAGACC CGGTGACCAG GTCGAGGTGT GGTTCAGCGG CAAGAAACCA
AAAGCAGGGA AGGTGACCAG CGAGCGCTTC ACCTACCAGG TGCACGACGA CGTCGGCGGC
GATGTCCTGG TGCTGGCGAT GGAGGATGTC ACCGGGCCGA GCCCGGAACA GGACGCCACC
AGCGCCAAGT ACGCCGACGA GATGACCGAC GCGCTGACCA CGGCCGGTCG CACCAGCGAC
GTGTACGACT TTGACACGAT GGGTCGCCGG GCCCCGCACC ACCTGGGTGT GCTGTCGCAC
TACCAGATGG TGCTCTGGGA GACCGGCGAC GACATCATTC CGCGTTCTCC GGGGCAGGTA
CCGGGCACCA TCGCCCGGGC GGCGGTGGAG ACCGAGCTGG CCGTCCGGGA CTACCTGAAC
GAGGGCGGCA AGCTCCTGCT CAGCGGCGAG TACGCGCTCT TCGCCCAAGC CGCCAACGGC
GCGTACGGCT ACCAGCCGAA CGGTCCGGCG GAGTGCACCG ACCCGGGCGA CGAGACGTGC
CTGCCGGCGC TCAACGACTT CCAGCAGTAC TGGCTGGGAG CCTTCACCTA CGTCAGTGAC
GGTGGCACCG GCGAGACCGG GCCGTACCCG GTGGTCGGCA CCGACGACCG GTTCACCGGC
TTCGACGGCG CACTGAACGC ACCGGGCTCG GCGGAGAACC AGGACCACAC GGCGTCGTTC
CTGACCACGT CGGGCTTCCT CCCGCCGGAT CAGTTCCCGC AGTTCGACAG CTCGGCACCG
CTGGGCTGGG AGCGGCCGGG GGGCGCGCCG TTCGACCCGC GTACCGGTGA CTGGTACCTG
TGGAGCGACC AGACCAACGA GTCGTACAAG CGGCTCACCC AGACCGTTGA CCTCAGCGGC
GCCAACGCCG CTGAGCTGCG CTTCTTCGCC TCGTACGACA TCGAGGCGAA CTGGGACTAC
CTGATCGTCG AGGCGCACGA GGTGGGTAGC GACAGCTGGA CGACCCTGCC GGACGCCAAC
GGCCGGACCA GCACCGACAC CGGGGACAGC TGCGAGTCGG GCTGGGTCGA GGCGCTGCAC
CCGTGGCTCG CCCGGTACCA GAGCGAGGAC TGCTCGCCGA CGGGCACCAC CGGCAGGTGG
AACGCCGTCA CCGGTGCCTC CAACGGCTGG CAGGAGTTCG CGGTCGACCT ATCCGGGTAC
GCCGGTAAGG AGGTGGAGGT GTCGATCTCG TACATCTCGG ACTGGAGCAC CCAGGGTCTG
GGCGTCTTCC TCGACGATGC CCGAGTGATC GTGGACAACG CCACGGTCAG CGACACCTCG
TTCGAGACGG ACCTGGGTGA CTGGGCGCTG GCCGGCCCCC CACCCGGCTC GGCTGAGACC
GCAGAAGACT GGTCGCGCAG CCAGCAGGCG TTCGAGGAGG GATCGGCTGT GGTCACCGCG
GACACCGTCT ACCTGGGCTT CGGCCTGGAA GGGTTGGCTC CGGCCGACCG GGCCGAGCTG
GTCCGGCGGA CGATGGACCA CCTGTTCGCG GCAGATCGCT CGTAG
 
Protein sequence
MRRTRLAIAG VFTLVGALAL TAPATAQPAS EPGSRDSLEV YVGTVDPEQL EKLRHAGVDL 
GHEHTETDRS GDIRVETVLS KRQAARLASQ GVQLEVKKVR GKPASEALRE QAATGWSAFR
SYSEPGGIRD EITATAARYP ELTKVMTIGR SHQGKPILAV KVTKNAEKTR DGKRPAVLYA
STQHAREWIT PEMTRRLMHH VLDNYGTDRD ITRLVDTTEL WFLPVANPDG YDHTFTPGNR
LWRKNLRDND GDGQITTADG VDLNRNFGYK WGYDNEGSSP DPISNTYRGP SPHSEPETRA
LDQLFRRVGF EFFVNYHSAA QLLLYGVGWQ VATPTPDDII YEAMVGDDEN PAVPGYDPDI
SAELYTTNGD TDTHATVRYG TLGFTPEMST CQAAAASDPD DEWLPEDCVS GFIFPDDEKL
ISAEVAKNLP FALAVAQSAH DPDEPVSVVG RSTPDFVVDS FDTSYGRNQQ VASITRRALR
NVRMHYTIND GRTKTVSVRE WRGGERYGDT HDDYYAELRG TVQGARPGDQ VEVWFSGKKP
KAGKVTSERF TYQVHDDVGG DVLVLAMEDV TGPSPEQDAT SAKYADEMTD ALTTAGRTSD
VYDFDTMGRR APHHLGVLSH YQMVLWETGD DIIPRSPGQV PGTIARAAVE TELAVRDYLN
EGGKLLLSGE YALFAQAANG AYGYQPNGPA ECTDPGDETC LPALNDFQQY WLGAFTYVSD
GGTGETGPYP VVGTDDRFTG FDGALNAPGS AENQDHTASF LTTSGFLPPD QFPQFDSSAP
LGWERPGGAP FDPRTGDWYL WSDQTNESYK RLTQTVDLSG ANAAELRFFA SYDIEANWDY
LIVEAHEVGS DSWTTLPDAN GRTSTDTGDS CESGWVEALH PWLARYQSED CSPTGTTGRW
NAVTGASNGW QEFAVDLSGY AGKEVEVSIS YISDWSTQGL GVFLDDARVI VDNATVSDTS
FETDLGDWAL AGPPPGSAET AEDWSRSQQA FEEGSAVVTA DTVYLGFGLE GLAPADRAEL
VRRTMDHLFA ADRS