Gene Sare_0449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0449 
Symbol 
ID5705319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp515046 
End bp516224 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content71% 
IMG OID641269974 
Producthypothetical protein 
Protein accessionYP_001535369 
Protein GI159036116 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.114768 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000434601 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCGGAAC CGCTGCACGC CGCGCTCGCT GAGGTACGGG CGCTGCTGCT CGCACCCGGG 
CTCAGCCGGG CGGTGGGCGC CGGACACCGC CGCGGGCTTC GCCCGACGGT GCACCGCGCG
GAGCTACGGC CGGTCACCCT CAAATCCGGC CCCCGACTCC AGATCACCAC CTCGGACGGG
AGTAGGCCGC ACACCCGCAA CGTCGGTTGG GACGGGGAAG CGGACGCGGC GGTGGACGCA
CTGTTGGCCG AACCGTTCGG CAACTGGCAT GTGGAGACCG CGGAGACGAC CCTGCAACTG
CGGGTCACGA AGTCCGGTGC GGCACAGGTA CACCGTGCGG CGGCCCAACC GGTAGCCGAG
CCGGCCGCCC ACGACCGGAC GAAGGCCCAC CTACTCGATC CCGGTGACCC GATCTTCACC
GTGATCGGTG CGTCGGCAGC CAAACGGCGA CAGGTGGACG CCTTTCTGCG GGCGCTCGCC
GCAACGCTCC CGGACGATCT CGCCGGTCCG CTGCACGTCG TCGACCTGGG TTGCGGAAAC
GCGTACCTGA CCTTCGCCGC GTACCACTGG TTGACCCAAC GGGGCCTCGA CGTCCACCTG
ATCGGTGTCG ACGTACGCGA GGACCAGCGC CAACGCAACA CCGAGTTGGC CCGGCGGCTG
GGTTGGACCG ACCGGGTGCG CTTCGTCGCG GGCACGATCG CCGACGCCCC GGTCGGGTCC
GCCCCCGATC TGGTGCTGGC CCTGCACGCC TGCGACACCG CCACCGACGA GGCGCTGGCA
CGGGCGGTGC GGTGGAGGTC TCGCTGGGTG CTCGCGGCGC CGTGCTGCCA CCACGACATC
GCCGCGCAAC TGCGCTCCAG GCCAACTCCG CCCCCATATG AACTACTGAC TCGGCAGGGC
ATCCTCCGCG AGCGGTTCGC GGACGTGCTC ACCGATGCGG TCCGGGCAGG ACTGTTGCGC
CTACACGGCT ACCGGGCCGA GGTGGTCGAG TTCGTCGACT CCCGGCACAC ACCCCGGAAC
CTGCTCATCC GGGCCCGACG TACCGGGGCG ATCCCCACCG GTGAGCGCTG GACGGAGTAC
CGGACCCTGG TGGATGGATG GAGGGTGACC CCGAGGCTGG CGATGCTGCT CGACGAGCCA
CCCGCTGGGA CGTCCACCGG TGCGGCCGTA GCCGACTGA
 
Protein sequence
MPEPLHAALA EVRALLLAPG LSRAVGAGHR RGLRPTVHRA ELRPVTLKSG PRLQITTSDG 
SRPHTRNVGW DGEADAAVDA LLAEPFGNWH VETAETTLQL RVTKSGAAQV HRAAAQPVAE
PAAHDRTKAH LLDPGDPIFT VIGASAAKRR QVDAFLRALA ATLPDDLAGP LHVVDLGCGN
AYLTFAAYHW LTQRGLDVHL IGVDVREDQR QRNTELARRL GWTDRVRFVA GTIADAPVGS
APDLVLALHA CDTATDEALA RAVRWRSRWV LAAPCCHHDI AAQLRSRPTP PPYELLTRQG
ILRERFADVL TDAVRAGLLR LHGYRAEVVE FVDSRHTPRN LLIRARRTGA IPTGERWTEY
RTLVDGWRVT PRLAMLLDEP PAGTSTGAAV AD