Gene Sare_3654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3654 
Symbol 
ID5704618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4216334 
End bp4217722 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content66% 
IMG OID641273079 
Productpolymorphic outer membrane protein 
Protein accessionYP_001538443 
Protein GI159039190 
COG category 
COG ID 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00892124 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00240143 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCAGC AGGACCATAC CCAGGAGCCC GAACCGGGTC ACCCTGGTCG CGGGCACCGG 
GCTAGGTCAC GGCGGCGGTG GTGGGCCATC GGGCTGGCCG GCGTGACCGG CCTGGCCCTC
ACCACCGTCG GCCTCGCCAC CGCCCCGACC GCCGACGCCG TCGGACGCAC CCTCGCCGCC
GACGGTCGCC TTGAGAAACC AGGCCAGTCT CATGGGGGCG ACCACCGCGA CGACAACCGC
GGCGGGCATG ACAACAGAAG CAGGAAGAAA CCGAAGGGCG TTCCGGTGCC CTGTGACGCG
GACAAACTCA TCGCCGCGAT CACCCTGGCC AACGCCCGCG GCGGCGCCGT GCTCGACCTC
GCCAAGAAAT GCACCTACCT GCTCACCGCC AACATCGACG ACGGCAACGG CCTACCCACC
ATCACCGCCC CCATCACCCT CAACGGCGGC AAACACACCA CCATCACACG CGCCGCCGGG
GTGGACCAGT TCCGCATCGT CACCGTCGGC ACCGGCGGCG ACCTCACCCT CAACCACCTC
AAAATCACCG GCGGACAGAC CGACGGCGAC GGCGGAGCAA TCCTGGTCAA CCCCGGCGGA
ACACTCCACC TCCACCACAG CACCGTCACC CGCAACATCA CCGGTGGAAG CGGCGGCGGC
ATCGCCAACA ACGGCACCAC CCGGATCAAG GACTCCACCG TCAGCCGCAA CACCTCGGGC
ACTAACGGCG GCGGCATCTG GAGCACCGGC CTGCTCACCG TCACCGCATC CCACATCACC
GCCAACGTCA GCGGCGCCTC TGGCGGAGGC ATCCACAGCG CTCAGGGCAC GTTGCTGGTC
GACCACAGCC GAATCACCGC TAATCACAGT CAGACCAGCG TTGGCGGCCT GGAGCTGTTT
GGTGGAGCCG GAACGGTCAC AAAGACGCAG GTCACCGGCA ACACCGCTCC TGCCGTTGGT
GGCGTGTTGG CCAACTCCGG GCAACTCACC CTCACAAAAG TCGTCATCGC CCACAACACC
GCAACTACGG GACTGTCCGG CGGACTGTCG GTGAACCCCA ACACGCTCAC GGTTGTCGAG
GACAGCATCA TCGACAACAA CAGCGCCGCC GCGAACGGTG GCGGAATCTT CAACTTCGGC
TCGCTGGTCC TGCGGAACAC GAAGGTCACC AGGAACAGAT CAGGAAACCA GGGCGGTGGA
ATCTACAACA TCTTCAGCGG AAACCTCTCC CTCTTCAACA CCAAGATCGT CAAGAACCTC
GCTGTCTCCG AAGGTGGCGG CATCTACAAC AACGTCCCCG GCGTGGTGAA CCTGAACACC
GCCACCGGCA CCATCGTGAC CAAGAACCGA CCCGACAACT GCGTCGACGT TCCGGGCTGC
CCTGGCTAG
 
Protein sequence
MNQQDHTQEP EPGHPGRGHR ARSRRRWWAI GLAGVTGLAL TTVGLATAPT ADAVGRTLAA 
DGRLEKPGQS HGGDHRDDNR GGHDNRSRKK PKGVPVPCDA DKLIAAITLA NARGGAVLDL
AKKCTYLLTA NIDDGNGLPT ITAPITLNGG KHTTITRAAG VDQFRIVTVG TGGDLTLNHL
KITGGQTDGD GGAILVNPGG TLHLHHSTVT RNITGGSGGG IANNGTTRIK DSTVSRNTSG
TNGGGIWSTG LLTVTASHIT ANVSGASGGG IHSAQGTLLV DHSRITANHS QTSVGGLELF
GGAGTVTKTQ VTGNTAPAVG GVLANSGQLT LTKVVIAHNT ATTGLSGGLS VNPNTLTVVE
DSIIDNNSAA ANGGGIFNFG SLVLRNTKVT RNRSGNQGGG IYNIFSGNLS LFNTKIVKNL
AVSEGGGIYN NVPGVVNLNT ATGTIVTKNR PDNCVDVPGC PG