Gene Sare_3080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3080 
Symbol 
ID5706851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3487147 
End bp3488343 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content70% 
IMG OID641272517 
Producthypothetical protein 
Protein accessionYP_001537885 
Protein GI159038632 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000201414 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGACGCCAA CTACGTCGGC AAGCTCGAAC GGGGCGAGCA CCGCTGGCCC CGCGACTCCC 
GGCGCGAAGC CTTCCGCGCC GTGCTCGGAG CGGTCAGCGA TGCCGCCCTC GGCTTCTACG
TCGTCCGCGG CGAACCGCAC CAGGCCGCCG ACCACGCGCA GATCGAGGTC GACCGGCTGG
AGACGGCTGA CGGGGGTAGG CGTGCGCTGC TCGGCGGATT TGCCGGCCTG GTCGCTGCCC
TCGGCCTGTT CGGTTGGGGA CCGCGTGGCG ACAAGAGCGC CCGGATCGGC GGCTCGGACG
TGGCCCGCCT CAACGCCGTC GTGGCGCTGT ACCGATCGGT GGACTACGAG TCCGGCGGCG
GCGTTGCGCG TCGAGGCGGG TCGGTTCGCC GAGGCGGCGT CGTCGCTGTC CGATCGACCC
TGCAACGACA CGGTCAAGCC CGCCCTACTG GCCGCCATCG CCAACGCCCG CCAGCTCGCC
GGTTGGGCCG CCTTCGACAC CGGCCACCAC TCCGACGCCC AACGCCACTG GCTATCGGCC
GAACGCACCG CCGTCGCCGC AAGCGACCTG CGACTAGCCG CCCGCGTGCG CTACTGCCAG
GCCCGACAGT TCCAGCACCT ACACCACAAC GGCGACGCCC TGGACACGCT GCGACTGGCC
CACGACCACC TCGCCGGCCG CGCCACCCCG GCAATCAACG CCATGCTGCA CGGCGCCGAG
GCCGCCTCCC TCGCGGCCAG AGGCGATCGA CAAGAGGCCC TGACCGCGCT CGGCGCCGCC
ACCGACGCCT TCGACCGCAT CGACCCCGAC TGCGAACCGG AGTGGATGCG CTTCTACGAC
CGCGGCGAGC TGCTCGCCCA ATACGGACGC GTCCACCGCG ACCTCGCCCG TAGCGACGAA
CGACACGGCA ACGCCGCCGT TCAATGGGTC ACCGAGGCCA TCGCCGCATT CGGCCCCCAA
AATGTACGCA GCACGGTACT CAACGAAGTC GGACTGTGTA GCGGCCTCTT CCTCGCCGGA
GAACCACAGG AAGCCGTCAT CATCGGCACC CGGGTTATCC AGCACTCCAA CCAGTTAACC
TCCCAGCGGG TACACGACCG CATCCGCAAC CTCCGCCGCG ACATGCATCG GTACGCAACC
GACCCGGAGG TCGCCGAGTT CAGCCGAACC CTGTCCACGA TCGGCTCGGG CACATGA
 
Protein sequence
MTPTTSASSN GASTAGPATP GAKPSAPCSE RSAMPPSAST SSAANRTRPP TTRRSRSTGW 
RRLTGVGVRC SADLPAWSLP SACSVGDRVA TRAPGSAART WPASTPSWRC TDRWTTSPAA
ALRVEAGRFA EAASSLSDRP CNDTVKPALL AAIANARQLA GWAAFDTGHH SDAQRHWLSA
ERTAVAASDL RLAARVRYCQ ARQFQHLHHN GDALDTLRLA HDHLAGRATP AINAMLHGAE
AASLAARGDR QEALTALGAA TDAFDRIDPD CEPEWMRFYD RGELLAQYGR VHRDLARSDE
RHGNAAVQWV TEAIAAFGPQ NVRSTVLNEV GLCSGLFLAG EPQEAVIIGT RVIQHSNQLT
SQRVHDRIRN LRRDMHRYAT DPEVAEFSRT LSTIGSGT