Gene Sare_1179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1179 
Symbol 
ID5703530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1326711 
End bp1328108 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content72% 
IMG OID641270697 
ProductSCP-like extracellular 
Protein accessionYP_001536078 
Protein GI159036825 
COG category[S] Function unknown 
COG ID[COG2340] Uncharacterized protein with SCP/PR1 domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.249096 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000057053 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTACGGCT GGAACGACCC GAGGAACCCG GACGGTACCC ACCGGCAGCC CGAACCGACA 
GCCAACCAAC CCGCCTGGCT GACCGACCGA CCAGAGCCAC GCTCCTCGTA CCTGTTCGGC
GACGAACCGG AGCAGCCGGC CGACGAACCG AAACGACCCA CCGACAGCTG GGGGCAACCG
ACCGACGGGT CGACCCGGCA CACCGACAGC TGGCACCGGC CCACCGACGG GTGGAGCCAA
CAGCAGCCAG CCACACACCG GCAGCCGGAG GCCGGAACAC AGAGCTGGGG CAGCGCGGAC
GACCCGTACC GCGGACCCGC GGGTGGTAGG TACCCCGAGC AGCCCACGCC GAGCTGGGGA
GCCAACCCCA CCTGGCAGCA GGGCGGGCCC ACCGACCCCT GGCGGCAGGA ACACCCGAGC
GACGGGCACC GCCAGCAGCC CGGGGGCTGG CAAAACGAAC CCACCGGTGA GTGGCACCGC
GGTGCCACTC CCAGCGGTGA GTGGCACCGC GCGGCCACCC CCACCGGCGG ACCCGAGCAC
ACCGACCGGT GGCGGCCCCA TCAGACGACC GGCGGATACG CCGACACGCC GACGACCCGT
ATGCCACCCG TCGTGGGCCC ATCGCCGACT GCCGACGTGC CGACCGGGGA TGGACCGCCC
GCCGGGCGCC GGAACCGGCG GCCCCTGTTC ATCGGGGGCG CGGCGGCGGC GGCCACACTG
GTGGTGAGCC TCGGGGTCGG TGCCGTCACC CTCATCGGTG GCGACGACAC CAGGCCCACC
TCGGCCGCCG AGGACATCGT GGCAACGAAC CCGACATTCG GCGAAAGCTC AGCGCCGGCC
ACCCCCACCA CCACGGCCAG CCCCAGCGCG ACCCCGGCGT CACCGTCACC GTCGGCCACA
CCGAGTCGGA AGCCGGCACC GGCACCGTCG CGCTCCACCG CCGCGTCCCG TCCCACTCCG
GCCCGGACCA CCACGTCCCC CGCCGGCAGC AACTCCACGC CCCCCAGCGG CAACATCAGT
GCCGACGCCG CCAAGGTGGT CAGCCTGGTC AACGCCGAAC GCGCGAAGGC CGGCTGCAAG
GCTCTGAGCG TCGACGACAA ACTGATGACC GCCGCCCAGC GGCACAGCCA GGACCAGGCC
GACAACAAGA AGATGACGCA CACGGGCAGC AACGGCAGCA CCCTCGGCGA CCGGGTCAAG
GCCGTCGGCT ACCGGTTCCG CGCCGCCGGG GAGAACGTCG CCTGGAACCA GCAGTCACCC
GAGGCCGTGA TGAACGCGTG GATGAACAGC TCCGGCCACC GGGCGAACAT CCTGAACTGC
TCCTTCACCG AAATCGGGGT CGGCATCGCG AGCAGCAACG GACCGTACTG GACACAGGTC
TTCGCCGCGC CGCTCTGA
 
Protein sequence
MYGWNDPRNP DGTHRQPEPT ANQPAWLTDR PEPRSSYLFG DEPEQPADEP KRPTDSWGQP 
TDGSTRHTDS WHRPTDGWSQ QQPATHRQPE AGTQSWGSAD DPYRGPAGGR YPEQPTPSWG
ANPTWQQGGP TDPWRQEHPS DGHRQQPGGW QNEPTGEWHR GATPSGEWHR AATPTGGPEH
TDRWRPHQTT GGYADTPTTR MPPVVGPSPT ADVPTGDGPP AGRRNRRPLF IGGAAAAATL
VVSLGVGAVT LIGGDDTRPT SAAEDIVATN PTFGESSAPA TPTTTASPSA TPASPSPSAT
PSRKPAPAPS RSTAASRPTP ARTTTSPAGS NSTPPSGNIS ADAAKVVSLV NAERAKAGCK
ALSVDDKLMT AAQRHSQDQA DNKKMTHTGS NGSTLGDRVK AVGYRFRAAG ENVAWNQQSP
EAVMNAWMNS SGHRANILNC SFTEIGVGIA SSNGPYWTQV FAAPL