Gene Sare_1701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1701 
Symbol 
ID5704012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1964344 
End bp1967448 
Gene Length3105 bp 
Protein Length1034 aa 
Translation table11 
GC content68% 
IMG OID641271204 
Productpeptidase M14 carboxypeptidase A 
Protein accessionYP_001536579 
Protein GI159037326 
COG category[S] Function unknown 
COG ID[COG4412] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00157811 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGGCGCA CACGACTGGC GATCGCCGGC GTGTTCACCC TGGTCGGCGC GCTGGCGTTG 
ACCACACCGG CAAACGCACG GCCGTCGTCC GAACCGGACA GTCGGGATAG TCTGGAAGTA
TATGTCGGCA CGGTGGATCC GGAGCAGTTG GAGAAGCTCC GGCACGCCGG GGCCGACCTC
GACCACGGAC ACACGGAGAC CGACCGGTCC GGCGACACTC ACGTCGAGAC GGTGCTCAGC
AAACGGCAGG CGGCCCGCCT GGCCAGCCAG GGCGTACGGC TGGAGGTCAA GAAGGTACGG
GGCAAGGCGG CCAGCCAGGT CCTCCGCGAG CAGGCCGCCA CCGGCTGGTC GGCGTTCCGG
TCCTACAGCG AGCCTGGCGG TATCCGGGAT GAGATCACCG CCACCGCCGC CCGCTATCCG
AAGTTGACGA AGGTGACGAC GATCGGCCGT AGCCACCAGG GCCAGCCGAT CCTCGCCGTC
AAGGTGACCA AGAACGCGAA GCGGATCCGC GACGGCAAAC GGCCGTCGGT GCTGTACGCC
AGTGCGCAGC ACGCCCGCGA GTGGATCACG CCGGAGATGA CCCGGCGGCT GATGCACCAC
GTGCTCGACA ACTACGGCGA GGACCAGGAC ATCACCCGGC TGGTGGACAC CACGGAGCTG
TGGTTCGTGC CGGTCGCCAA CCCGGACGGG TACGACCACA CGTTCACGCC CGGCAACCGC
CTCTGGCGCA AGAATCTGCG GGACAACGAC GGTGACGGGC AGATCACCAC CGCTGACGGT
GTCGACTTGA ACCGCAACTT CGGCTACAAG TGGGGGTACG ACAACGAGGG CTCCTCCCCC
GACCCGATCA GCAACACCTA CCGGGGACCG AGCCCACACT CGGAGCCGGA GACCCGGGCG
CTGGACAAGC TGTTCCGCAA GGTCGGCTTC GAGTTCTTCG TGAACTACCA CTCGGCCGCC
GAACTGCTGC TTTACGGCGC GGGCTGGCAG GTCGCCACCC CCACCCCGGA CGACATCATC
TACGAGGCGA TGGTCGGTGA CGACGACAAC CCGGCCGTGC CCGGCTACGA CCCGGACCTC
TCCGCCGAGC TGTACACCAC CAACGGCGAC ACCGACACGC ACGCCACGGT CCGCTACGGC
ACCCTCGGCT TCACGCCAGA AATGTCGACC TGCCAGGCCG CGGCGGCCTC TGACCCGGAC
GACGAATGGC TACCGGAGGA CTGTGCCAGC CGCTTCATCT TCCCCGACGA CGAAAAGCTG
ATCTCCGCGG AGGTGGCGAA GAACCTGCCG TTCGCCCTCG CCGTGGCACA GTCGGCGCAC
GACCCGGACG AGCCGGTGTC GGTCGTCGGC CTCAGCACCC CAGACTTCGT GGTGGACACC
TTTGACACGT CGTACGGCGG CAGACAGCAG GTCGCCTCGA TCACCCGCCG AGCGTTACGG
AACGTCCGGA TGCACTACAC GATCAATGAC GGCCGGACCA AGACCGTCAG CGTCCGGGAG
TGGCACGGCG GTGAGCGCTA CGGCGACACC CACGACGACT ACTACGCCGA GCTGCGGGGC
ACGGTACAGG GCGCCCGACC CGGCGATCAG GTCGAGGTGT GGTTCAGCGG CCGGAAACCC
GGAACGGGGA AAGTGACCAG CGAGCGCTTC ACGTACCAGG TGCACGACGA CATCGGCGGC
GACGTCCTGG TCCTGGCGAT GGAGGACGTC ACCGGGCTGA GCCCAGTGCA GGATGCCACC
AGCGCGAAGT ACGCCGACGA GATGGCCGCG GCGCTGACCG CGGCCGGGCG TACCAGCGAC
GTGTACGACT TCGACACGAT GGGCCGCCAG GCCCCGCACC ATCTGGGTGT GCTGTCGCAC
TACCGCATGG TGCTCTGGGA GACCGGTGAC GACGTGATCC CGCGTTCCCA AGGGCAGGTG
CCGGGCACCA TCGCCCGGGC GGCCGTGGAG ACCGAGCTGG CCGTCCGCGA CTACCTGAAC
GAGGGTGGCA AGCTCCTGAT CAGCGGCGAG TACGCGCTGT TCGCCCAGGC CGCCAACGGT
GCGTACGGCT ACCAGCCGAA CGGGCCGGCG GAATGCACCG ACCCAGGTGA CGAGACCTGC
CTGCCGGTGC TGAACGACTT CCAGCAGTAC TGGCTGGGCG CGTTCACCTA CGTCAGCGAC
GGCGGCACCG GTGACAGCGG GCCGTACCCG GTGATCGGCG AGGACAACCG GTTCGCCGGC
TTCACCGGCG CACTGAACGC GCCGGGCTCG GCGGAGAACC AGGACCACAC GGCGTCGTTC
CTGACCACGT CGGGCTTCCT TCCGCCGGAC GAGTTCCCGC AGTTCGACAG CTCGGCACCG
CTGGGCTGGG ATCGGCCGGG GGGCGCGCCG TTCGACCCGC GTACCGGCGA CTGGTACCTG
TGGAGCGGGC AGGCCAACGA GTCGTACAAA CGGCTCACCC AGACCGTTGA CCTCAGCAAC
GCCAGCGCCG GTGAGCTGCG CTTCTTCACC TCGTACGACA TCGAGTCGAA CTGGGACTAC
CTGATCGTCG AAGCACACGA GGTGGGCAGC GACGCCTGGA CGGCCCTTCC GGACGCCAAC
GGTAGGACCG GCACCGACAC CGGGGACAGC TGCGATTCGG GCTGGGTCGA GGCGCTGCAC
CCGTGGCTCG CCCGGTACCA GGGGGAGGAC TGCTCGCCGA CGGGCACCAC CGGCAGTTGG
CACGCCGCCA CCGGCGCCTC CAACGGCTGG CAGGAGTTCG TGATCGACCT GTCCGGATAC
GCCGGCAAAC AGGTCGAGGT GTCGATCTCG TATATCTCGG ACTGGAGCAC CCAGGGCCTG
GGCGTGTTCC TCGACGACGC CCGGGTGATC GTGGACGGCA CCACGGTCAG CGAGACCTCG
TTCGAAGCGG ACCTGGGTGA CTGGACGCTG GCCGGTCCGC CGCCTGGCTC GGCAGACAAC
ACGACCGACT GGGCTCGTAG TCAGCAGGCG TTCGAGGAGG GGTCGGCGGT GGTTACCCCG
GACACGGTAT ACCTCGGCTT CGGCCTGGAG GGGCTGGCTC CTGCTGCCCG TGCCGACCTG
GTCGAGCGGA CGTTGGACCA TCTGTTCGAG GCGGACCGCG GCTGA
 
Protein sequence
MRRTRLAIAG VFTLVGALAL TTPANARPSS EPDSRDSLEV YVGTVDPEQL EKLRHAGADL 
DHGHTETDRS GDTHVETVLS KRQAARLASQ GVRLEVKKVR GKAASQVLRE QAATGWSAFR
SYSEPGGIRD EITATAARYP KLTKVTTIGR SHQGQPILAV KVTKNAKRIR DGKRPSVLYA
SAQHAREWIT PEMTRRLMHH VLDNYGEDQD ITRLVDTTEL WFVPVANPDG YDHTFTPGNR
LWRKNLRDND GDGQITTADG VDLNRNFGYK WGYDNEGSSP DPISNTYRGP SPHSEPETRA
LDKLFRKVGF EFFVNYHSAA ELLLYGAGWQ VATPTPDDII YEAMVGDDDN PAVPGYDPDL
SAELYTTNGD TDTHATVRYG TLGFTPEMST CQAAAASDPD DEWLPEDCAS RFIFPDDEKL
ISAEVAKNLP FALAVAQSAH DPDEPVSVVG LSTPDFVVDT FDTSYGGRQQ VASITRRALR
NVRMHYTIND GRTKTVSVRE WHGGERYGDT HDDYYAELRG TVQGARPGDQ VEVWFSGRKP
GTGKVTSERF TYQVHDDIGG DVLVLAMEDV TGLSPVQDAT SAKYADEMAA ALTAAGRTSD
VYDFDTMGRQ APHHLGVLSH YRMVLWETGD DVIPRSQGQV PGTIARAAVE TELAVRDYLN
EGGKLLISGE YALFAQAANG AYGYQPNGPA ECTDPGDETC LPVLNDFQQY WLGAFTYVSD
GGTGDSGPYP VIGEDNRFAG FTGALNAPGS AENQDHTASF LTTSGFLPPD EFPQFDSSAP
LGWDRPGGAP FDPRTGDWYL WSGQANESYK RLTQTVDLSN ASAGELRFFT SYDIESNWDY
LIVEAHEVGS DAWTALPDAN GRTGTDTGDS CDSGWVEALH PWLARYQGED CSPTGTTGSW
HAATGASNGW QEFVIDLSGY AGKQVEVSIS YISDWSTQGL GVFLDDARVI VDGTTVSETS
FEADLGDWTL AGPPPGSADN TTDWARSQQA FEEGSAVVTP DTVYLGFGLE GLAPAARADL
VERTLDHLFE ADRG