Gene Sare_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1020 
Symbol 
ID5708181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1141604 
End bp1142911 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content65% 
IMG OID641270537 
Producthypothetical protein 
Protein accessionYP_001535922 
Protein GI159036669 
COG category 
COG ID 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGCC TGGCCCTCAC CACCGTCGGC CTGGCCGCCA CCTCGGCCGT CCTACCGGGC 
GGTGACCAGA CCAAAAAGCC CACCAACGAT CGACCCTCCG TGCAGGAACA CCGTACGGAC
GAACGCGGTA AGCGCGATGA CGGCAAATCC GACGACCCAA GCCGGGGCGA CAAGGGCAAG
GACCGGCCGA AGGGCACACC GGTTCCCTGC GACACGGATA CGTTGATCGC CGCGATCACC
CTGGCCAACG CCCGTGGTGG CGCGATTCTC GATCTTGCCG GGAACTGCAC CTACCTGCTC
AGCGCCACCA TCGACGGTGC CGGGCTGCCC GCCATCACCA CCCCCATCAC CCTCAACGGC
GGCAAACACA CCGCCATCGA GCGCGCCGCC GCCGTCGATC TGTTCAGAAT CCTCACGGTC
AACGCTGGCG GTGACCTCAC CCTCAACCAC CTGACCATCG CCGGCGGCCA CACCGCCGCC
GGTACCTCCG GTGGTGGCGT CCTCGTCAAC ATCGGCGGAA CGTTGACCGC CAACGACAGT
GCCATCACCC GCAACATTTC CGGTAACAAC GGCGGCGGCA TCCTCAACGA GGGAACCACC
GTCGTCACGC GCTCCCGAGT CACCCGAAAC ATCGCCGATC AGCTGGGAGG CGGGATCTAC
AGCTCGGGCC GACTGGAAAT CGTCAAGTCG CAGGTCGACG ACAACACCTC CGTGATCGCC
GGCGGGGTGT TGGCCTTTGA CGTCACTATC CGTGGAGGGA GTATTTCCGG AAATCATGGC
ACGACAGTTG GTGGGCTGTT CGTCCAAGGC GGCGTCGGCA CGGTTGTTGG CACCCGGATC
ATCGGGAACA CCGCCAACGT CATTGGTGGC GTCGCGGTCG CGGGTGCCGG GCAGCTGAAG
GTGCGGCACG TCAAGATCGC CGAGAACACG GCCACCGTCG CCGGTGGATT GTTCGTCCAA
GGAGGTGGGT TGGGCGACGA CAGTAGGGCC GTTGTCGAGG ACAGCGCCAT CGAGAGGAAC
ACCGCCACAG CCACTACCGC TGGCGGGGTC CACAACGCAG GGCAGGCGGT GCTGCGGCAC
ACAAAGATCA CCGACAATCA GGCCGACCTG GGCGGCGGGA TCTACAACAC CGACTCCGGC
ACACTCGCTC TCTACTCGAC CAAGGTCGTC AAGAACATCG CCGTCACCGA CGGTGGGGGA
ATCCTGAACG TCGTGGGCGG CTCCGTCGAG TTGAACACCG CCACCGGCAC GGTCGTGGTC
AAGAACCAGC CCAACAACTG CGTCAACGTT CTCGGCTGCC AGGGCTGA
 
Protein sequence
MTGLALTTVG LAATSAVLPG GDQTKKPTND RPSVQEHRTD ERGKRDDGKS DDPSRGDKGK 
DRPKGTPVPC DTDTLIAAIT LANARGGAIL DLAGNCTYLL SATIDGAGLP AITTPITLNG
GKHTAIERAA AVDLFRILTV NAGGDLTLNH LTIAGGHTAA GTSGGGVLVN IGGTLTANDS
AITRNISGNN GGGILNEGTT VVTRSRVTRN IADQLGGGIY SSGRLEIVKS QVDDNTSVIA
GGVLAFDVTI RGGSISGNHG TTVGGLFVQG GVGTVVGTRI IGNTANVIGG VAVAGAGQLK
VRHVKIAENT ATVAGGLFVQ GGGLGDDSRA VVEDSAIERN TATATTAGGV HNAGQAVLRH
TKITDNQADL GGGIYNTDSG TLALYSTKVV KNIAVTDGGG ILNVVGGSVE LNTATGTVVV
KNQPNNCVNV LGCQG