Gene Sare_0941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0941 
Symbol 
ID5706866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1065453 
End bp1066871 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content69% 
IMG OID641270459 
Producthypothetical protein 
Protein accessionYP_001535847 
Protein GI159036594 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.15053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.447115 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGACA CCCGTGACCG ACTCCACGCA CACCAGTTCG TGTCTGACCG GCTGGTTTCT 
GCGCTGGTGG GCGGCGACCC TGATGTGTGG GAACGGCCGA TGCGCCGAAG CCGGGTGGGG
GGCATGATCG GCCTCGTCCT CGCCCTGTTG GCCACCGCCG GTTTCGGCGT CTACGGCATG
GTCGCCGGTG GTCGCAGCAC CGCATGGCAG GAACCTGGTT CGATCATTGT TGAACGAGAG
ACCGGCACCC GTTACCTCTA CCTTGACGAC ATGCTGCGGC CGGTGCTCAA CTACAGCTCG
GCACTGCTGG CGGTCTCCGA CGACCGTGCG ACAGTACGGA TCGTCTCGCG GGCCTCGCTG
GCGACAGTGC CGCACGGGCG TCCGATCGGC ATACCCGGAG CCCCTGACGC CCTGCCAGAC
AAGGACAGCC TGAGTAGCGG CCCCTGGCTG GTGTGCACCG CCGGAACGGA TCCGGTGCCA
GGTCTGCCTG ATCGGCCGGG ACTGGTCGTG AGCCTCGACC GGGGTGGGCC GGTGGAACGG
GTACCCGAAG AACACGGAGT GCTGGTCAGC GCTGGTGGGC AGACGTACCT GGCGTGGCGA
GATCGGCGCA TGCGGATCCG GGACCATCGT GCGTTGGTGG CGCTGGGCTA TCGGAGCGTG
ACACCGATGC CGGTGTCGGC CGCCTGGCTC AACGCGTTGG CCGCCGGCCC TGACCTCGTC
GCGCCGGAGG TCACCGGGCA CGGTGCGGCG GGGGTGCCAC TCGGTGGCAA GTCCACCAAG
GTGGGGCAGG TCTTCACGGT ACGAATGGCC GGCGACACCG AAGAGTACTA CCTGATGCGC
CGTGACGGTC TGGCCCCGAT CACCGCCACC CAGGTGGCGC TCCTGTTGGC CGACCAACGC
ATAGCACAGC TCAATCCCGA TGGCCCTCGG CCGATCAATG CGGCCGCCGT GGCGCTCGCG
CCCGGGGCGA CGGTGGCAGT CTCGGGAAAG CACCCCCGTA CGCCGCCCCG TCCGGTGGAA
GGGTTGGACG GAGCTAGGGG CCTGTGTGTC GAGTTGTCCT TCGACGGAGA TCGGGGAGCG
GCGGGGCGGC TGGTGACTGT GCCGGCGGGT CGGGTAGCCG CGGCACGGCT TGTCGCCACT
GGTCCACCTG AGGATGGTCG GCTGGCTGAT CGGGTGCAGG TGCCGCCCAG CGGTGGGGCG
TTGGTGGTGG GGCAGTCAGC ACCGGGGGTG GAATCCGGCA CGCTGTATTT GATTAACGAG
ATCGGGGTGA AGTATCCCCT CGCGGGAGAG GAGGTCGTCG CCGCTCTCGG ATACGCGCAG
GCCCCGCGGG TGCCGGTGCC GACCACGGTG TTGGCGATGT TCACCACCGG CCCGGCCCTG
AATCCGCAGG CGGCCCGAAC GGAGGGTTCG TCACCCTGA
 
Protein sequence
MPDTRDRLHA HQFVSDRLVS ALVGGDPDVW ERPMRRSRVG GMIGLVLALL ATAGFGVYGM 
VAGGRSTAWQ EPGSIIVERE TGTRYLYLDD MLRPVLNYSS ALLAVSDDRA TVRIVSRASL
ATVPHGRPIG IPGAPDALPD KDSLSSGPWL VCTAGTDPVP GLPDRPGLVV SLDRGGPVER
VPEEHGVLVS AGGQTYLAWR DRRMRIRDHR ALVALGYRSV TPMPVSAAWL NALAAGPDLV
APEVTGHGAA GVPLGGKSTK VGQVFTVRMA GDTEEYYLMR RDGLAPITAT QVALLLADQR
IAQLNPDGPR PINAAAVALA PGATVAVSGK HPRTPPRPVE GLDGARGLCV ELSFDGDRGA
AGRLVTVPAG RVAAARLVAT GPPEDGRLAD RVQVPPSGGA LVVGQSAPGV ESGTLYLINE
IGVKYPLAGE EVVAALGYAQ APRVPVPTTV LAMFTTGPAL NPQAARTEGS SP