Gene Sare_2950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2950 
Symbol 
ID5707804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3346085 
End bp3347305 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content69% 
IMG OID641272399 
Producthypothetical protein 
Protein accessionYP_001537767 
Protein GI159038514 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.291 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0162357 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGACGC AGGCGAACCC AGCGACCACC AGCTGCACCG CTACTGCCGA GTCGGTGATG 
ATCCGTGACT CGATGAGTGC CGTCCCGCGG ATCTTCGCGA TGCCGCTGAC CTTCCTCACC
GGCAAACCGG CAACGGGTCA GCGCCCACTG CGGCTGACCC CCGGCATCCA CCTGGCGGCC
GCGACCGTGT CCGTGCTGAT CGGCCTGAGC CTGAGCTGGG TGGCGATGGC CGGTGGCGGG
TGGTGGCTGC TGCTGATCGT GGGCTGGGCG ATGACGCTGC ACGGTGCCCG TAACGCGCGG
ATGATGGTGT ACCACCAGGC GGCGCACCGG AACATGTGGG CCCGGCCGCG CCGGGACCAG
GTCGTCGGTC GGATCGTGGC CGGGGTGCTG CTGGTGCAGG ACTTCAGCCG GTACAGCACC
GAGCACGTCC TCGACCACCA CGCCGTCCAC CACATGACCG TGCGGGATCC CACGGTGCAG
GCATTTCTCA TCGGGCTGGG GCTCCGTCCC GGGATGACCC GACGGGAGAT GTGGCGGCGC
CTGATCGTCC ACAAGCTGCT CTCACCCACG TTCCACCTGA GCTTCCTGAT CGGCCGAATC
CAGTCGTACT TCGCGCCGGC CAGCTGGCGG CAGCGACTGC TCACCCTGAC GGTCTACGGT
GCCGTGATCG CGCTCGCCGT ACGCTTCGAC GCCTGGGTCT TCCTACTGGT CGCCTGGGTG
TTGCCGATGA CCTTCTTCTA CCAGGTCAGC AACACGCTGC GGCTGTGCGT CAAGCACACC
TTCCCGTCCC CCGCGGCCAC CGAGCGGCGC GGGCGCGGGT ACTTCGCCAG CCTCACCAAC
GCGATCCTGA TCGGCGAGCG GGCCCCGGAC CGCGAGGTCA GCGGGCGGCT GCGTCGCCTG
CGCGGCTGGG CCCGGTGGTG GCTGCGCATG CTGACCGTCC ACCTGCCGGT CCGCTATCTG
GTCCTGACGG GCGACACCGT GGTGCACGAC TTCCACCACC GCCACCCGAT GAGCCGGGAG
TGGGCGGACT ACATCTTCGC CCGGCAGGCG GACATCGACG CCGGGCACCG CGGCTGGCCG
CCGTACCGGG AGATCTGGGG CCTGGTGCCC GCGATCAACC TGGTGTTCGA GTCGCTGTCG
CGGGCCGACC CGCAGGAGTA CGACCGGGCA CGCATCCCAG AGGTCAGCGG ACGCAGCGTC
TTCTCCGCCT TCGACGACTG A
 
Protein sequence
MVTQANPATT SCTATAESVM IRDSMSAVPR IFAMPLTFLT GKPATGQRPL RLTPGIHLAA 
ATVSVLIGLS LSWVAMAGGG WWLLLIVGWA MTLHGARNAR MMVYHQAAHR NMWARPRRDQ
VVGRIVAGVL LVQDFSRYST EHVLDHHAVH HMTVRDPTVQ AFLIGLGLRP GMTRREMWRR
LIVHKLLSPT FHLSFLIGRI QSYFAPASWR QRLLTLTVYG AVIALAVRFD AWVFLLVAWV
LPMTFFYQVS NTLRLCVKHT FPSPAATERR GRGYFASLTN AILIGERAPD REVSGRLRRL
RGWARWWLRM LTVHLPVRYL VLTGDTVVHD FHHRHPMSRE WADYIFARQA DIDAGHRGWP
PYREIWGLVP AINLVFESLS RADPQEYDRA RIPEVSGRSV FSAFDD