Gene Sare_3646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3646 
Symbol 
ID5703340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4206889 
End bp4208418 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content68% 
IMG OID641273071 
Productpolymorphic outer membrane protein 
Protein accessionYP_001538435 
Protein GI159039182 
COG category 
COG ID 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.749989 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00730149 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGCGGC CGGACGGCCG CCACCGCTTC GTCAGGAGCG CGGACCGACC GACACGAAGG 
TTGACCATGA TGCCGGACCA CGCACGGCCA CCAGGAACAC CCCGCGGGCG GCGCCTCGCC
CGGACAGCCA CAACACTGGG CGTGGCGACT CTCGCCAGCC TGACGACCGT GGCCGCCACC
GGAACCATGA GCCTCGCCCA CACATCAGGG GCGCTTGCTC GAGCTTCCGT GTCCGAGGGC
GACGGCGGCG CGCCAAACCC GGGCAAGGGT GGCCCGGAAC GGCACCGGGA CAGCTCCGCC
GAGGGCGACC GGGGTGGCGA GAAGAGCGCC GCCGGTGATC CGGGCTGGCA CCGCGACGAC
CACGGTGACC ACAAGGCCAA GGGCACCCCC GTGCCCTGTG ACCCCAACGC GCTCATCGCC
GCGATCACCG CCGCGAACCA GGCCGGCGGA GGCACCCTGC GCCTGGCCGA GAAGTGCCGC
TACACCCTGA CCGTCAACCA GGACGACAAC GGACTGCCCC CCATCTTCCA ACCCATCACG
ATCCACGGCC AGGGCGCCAC CATCATCCGC GCCGCCGCCG CCGACAACTT CCGCATTTTC
AACGTCACCA CCGGCGGCGA CCTCACCCTC AAAGACCTCA CCGTCGCCGG CGGACGAGTC
GAGGGCGCCA ACACCGACGG CGGCGGGATC TTCGCCGGTG AGGGCACCAG GCTGACCCTT
GAACGTGTCA CCGTCCGCAA CAACATGGCC CTCGGCAACA ACGGTGACGG CGGCGGCATC
TCCGGCGACC GCAGCAAGGT CACGATCACC AAGAGCACCA TCACCGACAA CACCGCTGCC
GACGAAGGCG GTGGCGTCTC CAGTGACAAC GGTGCCTTCA CCATCAGCAA GTCGAAACTG
ACCCACAACA CCGCCGGCAC CACTGGCGGC GGATTCTCCA ACGACGACGG CACCGCCACC
ATCAGCCACA CCGTGATCAG CGACAACAGC GCCACCGACG GCGGCGGCGT GGACAACGAC
GGCGATGTGA CCGACATTGT CCACAGCACC ATCACCCGCA ACACCGCCAG CGCACTCGGC
GGCGGCATCT ACGACAACAG CAACGACAGT CTGCTGCTGC GGCACACCAC CGTCGCCAAG
AACACCGCCA CCTCCGGCGG CGGGATCTAC ACCACCGGCA GCATCGGCGC CACCATCGAG
GACAGCAAGA TCGTGCACAA CATCGCCACC ACCGGCGACG GCGGCGGCAT TTCCATCAAC
GGCCCGAACG GCCCGGTAGT GGCCCTGCGT CGGAGCACCG TCTCCGACAA CCAGGCCACC
GGCCGCGCCG GCGGCATCTT CTTCAACCCA CCCGACGACA CCCTGCTGAC CCTCACCGAC
GTCCGCGTCA CCAAGAACCT CGCCCAACTC GAACCCGGCG GTATCTACAA CAACGGCACC
GTCATCGTCC TGGGCAAGAC CACCATCATC GACAACCGAC CCACCAACTG CGTCGGCAGC
CCCAACCCCG TACCCACCTG CTTCGGCTGA
 
Protein sequence
MTRPDGRHRF VRSADRPTRR LTMMPDHARP PGTPRGRRLA RTATTLGVAT LASLTTVAAT 
GTMSLAHTSG ALARASVSEG DGGAPNPGKG GPERHRDSSA EGDRGGEKSA AGDPGWHRDD
HGDHKAKGTP VPCDPNALIA AITAANQAGG GTLRLAEKCR YTLTVNQDDN GLPPIFQPIT
IHGQGATIIR AAAADNFRIF NVTTGGDLTL KDLTVAGGRV EGANTDGGGI FAGEGTRLTL
ERVTVRNNMA LGNNGDGGGI SGDRSKVTIT KSTITDNTAA DEGGGVSSDN GAFTISKSKL
THNTAGTTGG GFSNDDGTAT ISHTVISDNS ATDGGGVDND GDVTDIVHST ITRNTASALG
GGIYDNSNDS LLLRHTTVAK NTATSGGGIY TTGSIGATIE DSKIVHNIAT TGDGGGISIN
GPNGPVVALR RSTVSDNQAT GRAGGIFFNP PDDTLLTLTD VRVTKNLAQL EPGGIYNNGT
VIVLGKTTII DNRPTNCVGS PNPVPTCFG