Gene Sare_0038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0038 
Symbol 
ID5707318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp46934 
End bp48370 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content74% 
IMG OID641269563 
Producthypothetical protein 
Protein accessionYP_001534965 
Protein GI159035712 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGAC GCGGCTGGGG AACATCGATC ACGACCGCGC TTGGCACCGC TGCCGGAGCG 
GGTGCAGCTC AACTGGGCTT CGGCTACGGC CTGGGCATCA TCAACTGGGC GCCACCACCC
GACGAGAGCA CCGCCGCCAC CGCCTGGACC GCCAGTCTCA TCTGGGCCAT GTGGATCGCC
GCCACGTCCA CTGTGGTTGG CGCCGTCGGT GCCCAACACC TGCGCCGCCG CAGCAGAGCT
GGCGACCACG CAACCAGCAA CCGCAGCGGC AGCGGCAGCG GCGAGGCCGA AGCGACGTCG
GCAACCACCA GCCCCGGCGA TCTCGACCCC GCCGCGACCA CGGAGACAGG GGCGCGACAC
CTCTCGGTGG ACGCGAGCGA CACCGACGGT GCACTGGGCA GGCTGGCACT CGCCGCGGCA
GCCGGATTCG GTGCGCTGGT CACGGTGCTG CTGACCGCCG TGCCGGCACG GGTCGCCGTG
GTGCCCGGCG TCACCGCACC CCGGGACGTC GCCGCAGGGT ACGCCACAGT GGGCGTACTG
GTCGGGGTGG CCATTGCTGT ATGGGCACTG CACTCCCGCG CTGCCGCCGG CAACGTGATC
GCGACCTTGG GCTGGCTGTG GCTGCTCGCC GTGGTGGCCG TTGTCGACGG TGTCGTCGCG
GGGCGTGGGC TCAGCAGCGC CCAGCTTGGT ATCTGGCAGC TCAGCGCCGG CGGGGAGGGG
CTGTGGCTCC GCGACTGGTT CTACTGGCCG GGCGCAGTGC TGTCACTCGG TTCCGCCCTA
CTCATCGGCG TACTGGTCGC CCGCCGTGCA CCCAGGCACC CCGACCGCCA GGTGGGTGCC
ACCGCCTCCG GGGCGGCCGG CCCGCTCCTG GTCGCGGTTG CCTACCTGGT CGCCGTGCCG
GACCTGGCCG AACTCGCTGC CGGACAGGCG TCGGCGCACC TCATCGCCCC GTACGCGGTC
ATCGTCGGTT TCGGGGGCTC GGCACTGGTC ACGGCGCTCG GCAACCGGGC CGACCGCCGG
ACGCGGGCCT ACCCGCCCCG ACCGGTGGAG TCCCACGCCG GCCCAGGCAC CGACGGTTCG
ACCACCGCGA CGCCCACCAC TCGCGGTCGA GTGCGCGGAT CGGGCAGTCG GCGCTCGCGC
ACCGTGAAGT CGGACCCGAC CGCCGAAGCG GGTGAGCCCA CGGCGGCCAG CGACGCTACC
GGCGCGTCCG CCCCCGACGA CCAGCCGAGC GACGCATCCG GTCGCCCCTC GGCCCCCGGG
GGCGCCCGTC GGGGCCGGGG CACCGCCGCC CGGTCGGGCT CCGCGGGCGG GGATCCGACC
ACCGAGGTGC CGGTTCAGCG CACCGCAGAA GCCGCACCAG CGGACGCGAC CACCGCGTCG
AAGGAGTCGA CGGACGGGGG CTCACGCCGG GCGCGGTCCA CCCGCCGCAC CAGCTGA
 
Protein sequence
MARRGWGTSI TTALGTAAGA GAAQLGFGYG LGIINWAPPP DESTAATAWT ASLIWAMWIA 
ATSTVVGAVG AQHLRRRSRA GDHATSNRSG SGSGEAEATS ATTSPGDLDP AATTETGARH
LSVDASDTDG ALGRLALAAA AGFGALVTVL LTAVPARVAV VPGVTAPRDV AAGYATVGVL
VGVAIAVWAL HSRAAAGNVI ATLGWLWLLA VVAVVDGVVA GRGLSSAQLG IWQLSAGGEG
LWLRDWFYWP GAVLSLGSAL LIGVLVARRA PRHPDRQVGA TASGAAGPLL VAVAYLVAVP
DLAELAAGQA SAHLIAPYAV IVGFGGSALV TALGNRADRR TRAYPPRPVE SHAGPGTDGS
TTATPTTRGR VRGSGSRRSR TVKSDPTAEA GEPTAASDAT GASAPDDQPS DASGRPSAPG
GARRGRGTAA RSGSAGGDPT TEVPVQRTAE AAPADATTAS KESTDGGSRR ARSTRRTS