Gene Sare_1617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1617 
Symbol 
ID5703398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1848765 
End bp1850282 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content64% 
IMG OID641271125 
Productpolymorphic membrane protein 
Protein accessionYP_001536500 
Protein GI159037247 
COG category 
COG ID 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.926452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.174352 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTATC AGGATCACAA CCGCGGACAC GAGTGGGATC GTTGTGGTGG TGGCCTTTCG 
CGGCCACGGT GGCGGTGGTG GGTCGTCGGG TTGGCGGGGA TGACCGGCCT GGCGCTGACT
GTCGTTGGTG GTGTGGCGAC GCCGGCCGTC AATGCGGGCG GACGTGTCCT CACCAGCGTC
GAGGAGCAGC CTGGGGAGTC CGCCGGGGAT CGGCCCGGGG GTGGGCACCG CGACGACGGC
GAGTCCACCC GGGGAAAGGC CACACGTGAG CAGTCCCCGC GTGAGGACAA GGGTCAGGGT
AACGCGGCTG AGGGCAGAAG CAGCGGCGGC GACGAGGGCA AGGGCGTTCA GGGCAAGGGC
AAGGGTGTTG TCGAGGGCAT GGGTGAGGCC AGGGGGAGGA AGGGGGGTAA GCGGCAGGGT
ACGCCGGTCG CGTGCGACGT GGACAGGTTG ATCGCTGCGA TCAGTGCGGG TAATGCTCGT
GGTGGTGCCG TGCTCGACCT GGCCAAGGGT TGTACCTATC TGCTGACTGC TGATCTCGGG
GGTGCTGGTC TGCCCGTGAT CACCGCCCCG ATCACGCTCA ATGGTGGTAA GCACACGACT
CTCAAGCGTG CCGCTGGGGT TGACCAGTTC CGGATCCTCA CCGTCGACAC CGGCGGTGAG
CTCACCCTCA ACCACTTGAC TGTCACCGGC GGGCAGACCA TCGAGAACGG CGGGGCGATC
TTTGTCAACG CCGGTGGTGC TCTCACCCTC GATCACACCA AGATCGTGCG CAACATCAGC
GCACAGGGAG GCGGCGGCAT CCGGAACGCG GGAACCGTCA CCATCAAACA CTCCCGCATC
GAACGGAACA TCGCCAATGA GAACGGGGGT GGGATCCTCA GCACGGGCGT GCTCCGGATG
CACTCCTCCG CTGTGGACGG CAACGTCGCC ACCAACGGTG GTGGCATCTC CAGCACCGGC
ACCGTTACTG CTGCCCGCAG TGAAATCACC GGAAATCGAG CAACATCCGT GGTCGGCGGC
GGGTTGCTCG TCACCGGAGG AACCGCATCG GTCACCGATT CTCGGGTGGT CCGCAATTCC
TCCAACACTG AGGGCGGCGG GGTTGTGGCC ATTACCTCGC AGATCACCCT GCACCGTGTC
GTCATCGCCG ACAACGCCGC CCGCAGTGGC GGTGGTGGGT TGATTGTCGG TGCCTCCTCA
GCCGTCGTCC GGAAGGGTGT TATCAAGGGC AACACCGCCA ACACCGATGG CGGAGGTGTA
TTCAACGCTG GTGAGTTGTC ACTGTTCGAC ACCCACGTCC ACAGCAACCG TGCCGGTCGT
GGAGCGGGCA TCTACAATAA CGCACTCGGT GCCCTCATCC TGTACGACAG CACGATTAAG
AAGAATATCG CTGTCGCCGA GGGTGGTGGG ATCTTCAACG AGGCGGCTGG AACGGTCGAT
CTGAACACAG CTACTGGCAC AATTGTCATC AAGAACCGGC CCGACAACTG CGTCAACGTT
GCCGGCTGCC CGGGCTGA
 
Protein sequence
MSYQDHNRGH EWDRCGGGLS RPRWRWWVVG LAGMTGLALT VVGGVATPAV NAGGRVLTSV 
EEQPGESAGD RPGGGHRDDG ESTRGKATRE QSPREDKGQG NAAEGRSSGG DEGKGVQGKG
KGVVEGMGEA RGRKGGKRQG TPVACDVDRL IAAISAGNAR GGAVLDLAKG CTYLLTADLG
GAGLPVITAP ITLNGGKHTT LKRAAGVDQF RILTVDTGGE LTLNHLTVTG GQTIENGGAI
FVNAGGALTL DHTKIVRNIS AQGGGGIRNA GTVTIKHSRI ERNIANENGG GILSTGVLRM
HSSAVDGNVA TNGGGISSTG TVTAARSEIT GNRATSVVGG GLLVTGGTAS VTDSRVVRNS
SNTEGGGVVA ITSQITLHRV VIADNAARSG GGGLIVGASS AVVRKGVIKG NTANTDGGGV
FNAGELSLFD THVHSNRAGR GAGIYNNALG ALILYDSTIK KNIAVAEGGG IFNEAAGTVD
LNTATGTIVI KNRPDNCVNV AGCPG