Gene Sare_4200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4200 
Symbol 
ID5704200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4767459 
End bp4768727 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content76% 
IMG OID641273619 
Producthypothetical protein 
Protein accessionYP_001538972 
Protein GI159039719 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0938646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00891245 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCATCCG TCACCCCCGA CCAGCCCGGC CGTCGCGCCC GGACGGGCGC ACGTCCGCCT 
GGGCGACCTG GGCCGGCTCG CCGGGTACCC CCGCCACGGG CGGGCGGGCC GCCGAGCCGT
CCGCCGCTGG CCGTCGCCGC AGGGGCCGCC GCTGTCGGAG CCGCACTCAC CTCGTGGCTG
CCGGTAGCGG TGGTGCTCTG GTTCTTCCAG CTCAGTGAGA GTGCGGCGAC ACTGTTGGGA
GTGGTCCGGA TCGGGCTGGC CGGTTGGCTG CTCGGACACG GCGTACCCCT GGTGACGGAC
GTCGGTCCAC TCGGGCTCGC TCCGCTGGCC GTGACCCTGC TCGCCGGCTG GCGCCTCACC
CGTGCCGGGG TGCACGTCAC CCGAGCTATC GGCGCGCGTG GCAGCCGTTC GCCGAAGCGC
GCGGTCCTCG CCGCGGGCGC GGTCGGGTTC GCGTACGCCG TCCTTGGAGT GTCCGCTGCC
CTGCTGGTCA CCACCGGGGA ACCGGCCGTG TCGCCGGTGC GGGCCGGCCT GACCCTCGCC
GTGGTGGGAA CGGTCGCCGC CCTGGCCGGC GCAGTTCGTA CGACCGGACT CGGTGACCAG
TTCGCCGAGC GGGCGCCACT ACCGCTGCGG GAGGGGATCC GCACCGGCCT GGTCGCGGCC
CTGTTGCTGC TCGCGGCGGG GGCTGGCATG GCGGGGCTGG CGGTGGCGAC CGGCGGCGGT
GATGCGGCCG ACCTGATCGG GAAGTACCAC ACCGGGGTGG CCGGGCAGGC CGGGATCACC
CTGGTCAACC TGGCGTATGC CCCGAACGCG GCGGTCTGGT CGACCAGCTA CCTGCTCGGT
CCCGGGTTCG CGGTCGGCAC CGACACCACG GTACGGACCA GCGAGGTGAC GGTGGGGGCG
TTGCCGGCCC TACCGTTGGT CGCCGGCCTC CCCGGTGGGC CGGCAGACGG CCTCGGCGCG
GGTCTGCTTG CGGTGCCGGT CCTGGTCGGG ATGGTGGCGG GCTGGCTGTT GACCCGCCGG
GTGTTGCGGC TCGTCGACGA GGGCGCCCGG CGACAGTGGG GGCCGCTCCT GCGGCCGGCG
GCACTCGCCG GCCCGGTAGC GGGCCTGCTG GTGGGACTCG CGGCGGCGGC GTCGGCCGGT
TCGCTGGGTG CTGGCCGGCT GGCCGAGGTA GGGCCGGTGC CGTGGCACGT GGCGGCCGTG
GCGACCGCGG TGACCGGGGC GGGTGTGCTG GGTGGCGTGG TCGCGGCCCG TTTCCTGTCC
CGTGCCTGA
 
Protein sequence
MPSVTPDQPG RRARTGARPP GRPGPARRVP PPRAGGPPSR PPLAVAAGAA AVGAALTSWL 
PVAVVLWFFQ LSESAATLLG VVRIGLAGWL LGHGVPLVTD VGPLGLAPLA VTLLAGWRLT
RAGVHVTRAI GARGSRSPKR AVLAAGAVGF AYAVLGVSAA LLVTTGEPAV SPVRAGLTLA
VVGTVAALAG AVRTTGLGDQ FAERAPLPLR EGIRTGLVAA LLLLAAGAGM AGLAVATGGG
DAADLIGKYH TGVAGQAGIT LVNLAYAPNA AVWSTSYLLG PGFAVGTDTT VRTSEVTVGA
LPALPLVAGL PGGPADGLGA GLLAVPVLVG MVAGWLLTRR VLRLVDEGAR RQWGPLLRPA
ALAGPVAGLL VGLAAAASAG SLGAGRLAEV GPVPWHVAAV ATAVTGAGVL GGVVAARFLS
RA