Gene Sare_4839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4839 
Symbol 
ID5707744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5488888 
End bp5490252 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content71% 
IMG OID641274235 
Producthypothetical protein 
Protein accessionYP_001539580 
Protein GI159040327 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000509999 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCAGACCC AACGCGACCA CGTCCACGCC CACACCTTCA TGATGGGCCG GCTGAGCTCG 
GCCCTGGTGT TGGGTGACCC GACCGGGGCC GAGATTCCCG GCCGCCGCGC GCAGACCGGC
CTGCTGATCG GGATCATCCT GGCGCTGCTG GTGTCCGGCG GCTTCGCCGT GTACGGGTGG
ATAGTCCCCG GAGGAAGCAC CGCCTATCGG CAGGCCGGGG CGATCCTGGT GGAGAAGGAG
ACCGGCAACC GCTACGTCTA CCTCAACGGG CTGCTGCACC CCACCCCGGA CCTGACCTCG
GCGATGCTGA TCCAGGGCCC CTCCGCCGAG GTCACGTTGA TTTCGAAGAA CTCCCTCCGG
GATGTGGCAC GTGGTGCCCC GCTCGGGTTG GCCGGCGCGC CCAGGCAGCT ACCGTCGACC
GACGGGTTCG TACGGGGGCC GTGGCTCGCC TGCCTGCCCG GCTCTGTCGC CCCCGGTCGA
TCCGTGTCCG GGCTGGGCAT CAACCTCGAT CCCGGGGTAG CCGCCGACCC GTTGCCGGCG
GACCGGTTCG TCGTCGTACG TGACCAGCGT GACGTGGCCT ATCTGCTCGC CAACGGGGTG
AAGTATCGGG TCGACGACGA GGCGGTGCTG GTGGTGTTGG GTGCCGCCAC CGTCAGCCCG
GCCCCGGCCC CGCAGTTGTG GCTGGATTGG CTGGACGACG GGCCCGCCCT GGCGCCCGCC
CGGATCGAGG GCGCCGGTGC TCCCGGTCTG CAGGTGGGTG GGCGTGCCCA CCCCGTCGGG
ACGCTCTTTC GTCAGCGGGT GGAGTCCGGC TCCGAGCAGT TCTTCGTGCT GCGCCGGGAT
GGGCTCGCAC CGATGAGCCG GACGGAGTTC CTGCTGGCCG ACGCCAAGGA CGAGGACGCT
GCGGTCGAGC TGAACCCGGC GGCGATCGTC GACGCTCGGC GCTCCGCCGA CCGCTCGCTG
CTGGACCGGT TGCCCGACCT CACGCCGCTG CGGCTGCTGG ACACCGCCGG ACGTGCCCTG
TGTGCGCGGC AACGCCCGGT CTCGGCCGAG GAGTACGCCA GCGAGGTGGT GCTGGTACCG
CAGCCGGCAG CCGCCATGAG CGCGGACGGC ACGCCGCTCG TGCTGACCCG TCCCGGGGCC
GGGATGCACG TGGTCGCCGC CCCCGTGCCG GCGCAGACCG CCACCGCACA CACCTTCGTC
ATCTCCGACG ACGGCATCGC CTACCGTCTC GCGGACCAGG CCACGAGGTC CGCGTTGAAG
CTGGGCACGG TCGCGCCCAT ACCGTTTCCG AAGGACCTGT TGGCGGCAAT GCCGCAGGGA
GCCGTGCTGA GTCGTGAGGC TGTCACAAGC CTGCCGAGGG GGTAG
 
Protein sequence
MQTQRDHVHA HTFMMGRLSS ALVLGDPTGA EIPGRRAQTG LLIGIILALL VSGGFAVYGW 
IVPGGSTAYR QAGAILVEKE TGNRYVYLNG LLHPTPDLTS AMLIQGPSAE VTLISKNSLR
DVARGAPLGL AGAPRQLPST DGFVRGPWLA CLPGSVAPGR SVSGLGINLD PGVAADPLPA
DRFVVVRDQR DVAYLLANGV KYRVDDEAVL VVLGAATVSP APAPQLWLDW LDDGPALAPA
RIEGAGAPGL QVGGRAHPVG TLFRQRVESG SEQFFVLRRD GLAPMSRTEF LLADAKDEDA
AVELNPAAIV DARRSADRSL LDRLPDLTPL RLLDTAGRAL CARQRPVSAE EYASEVVLVP
QPAAAMSADG TPLVLTRPGA GMHVVAAPVP AQTATAHTFV ISDDGIAYRL ADQATRSALK
LGTVAPIPFP KDLLAAMPQG AVLSREAVTS LPRG