Gene Sare_1609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1609 
Symbol 
ID5703464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1839018 
End bp1840325 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content62% 
IMG OID641271118 
Producthypothetical protein 
Protein accessionYP_001536493 
Protein GI159037240 
COG category 
COG ID 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.44696 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0279779 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGCC TGGCCCTGAC CACCGCCGGG ATCGCCGCCA GCCCCGCCGC CGCACGCATC 
ACTCCCAGCG CCGAGGACCG GTCCGAGCAA CTCGGTCGAC CCGCCCCCGA CATACACCGC
GGCGAGGGTA AGGGCGACGA GAAGCGCAAG GGTAGGAGCA AACAGGGGAA CGAGAGGTCG
AAGGGCATCC CGGTCGCCTG CGATACGGAC AAACTGATCG CGGCGATCAC CCTCGCCAAC
GCCCGTGGCG GTGCCGTCCT CGACCTCGCC AAAGGCTGTA CCTACCTACT GACCGCCGCC
ATCGACGGCG CCGGATTGCC TGCTGTCACC ACCCCCATCA CCCTCAACGG CGGCAAACAC
ACCACCACCA CCATCACACG CGCCGCCGCA GCCGACCAGT TCAGAATCCT CACCGTCGAC
ACCGGCGGCG ACCTTACCCT CAACCACCTC ACCATCACCG GCGGACAAAC CACCAACGCT
GGCACTGACG GGGCTGGCGT CCTCGTGGAC GCAGGTGGAA CGTTGACCAG CAATCACAGC
GCCATCACCC GCAACATCGC CGGCGGCAGT GGTGGCGGCA TTGCCAACAA TGGCACCACT
CATGTCCACG CCTCCAACGT CAGCCACAAC ACCGCTTCCG CTGCCGCCGG AGGTGTGGCA
AGCTCTGGCG TACTCGAAAT CAGCAAGTCC AGCATCCATG CCAATGCCGC CGTGGACGGA
GGTGGGGTCA CGAGTTCGGG AACTGTGCGG ATCGAACACA GCAGAATTTC CGGTAATCGC
GCCCGCAGCT CTGGCGGTGG CCTTTTTGTC ATGAGGGGTA CTGGCTCTGT CGTGGACAGT
AGCGTTACGA CGAACATCGC CGGTGACGTG GTTGGTGGCA TCATCGCGAG CCTTGGCGTG
CAGATGACGA TCCGGTCCTC TGTCATCGCG GAAAACGTAG CTGAAGCGGG AATTGTCGGT
GGTCTGGGGG TGGACGAAGA CGCCTCGGTT GTCGTTACGG ACAGCATTAT CAAGAACAAC
AACACCAGCA GCAATGGCGG TGGGGTCTAC AATATCGCCG AACTGGTACT CAGGAACACG
AAGGTCTTTG GTAATCAGTC AGGTGATCAG GGCGGCGGAA TATACAACGA AGCCTCCGCA
ACCCTTGCCC TGTTCGGCAC CAAGGTCACC AAAAATATTG CCGTCACCGA CGGTGGTGGC
ATCTTCAATC AAGTGGGCGG CACGGTGGAC CTGAACAACG CTACCGGCAC CGTCGTGGTC
AAGAACCGTC CGAACAACTG CATCAACGTC CCCGACTGCC TCGGGTAG
 
Protein sequence
MTGLALTTAG IAASPAAARI TPSAEDRSEQ LGRPAPDIHR GEGKGDEKRK GRSKQGNERS 
KGIPVACDTD KLIAAITLAN ARGGAVLDLA KGCTYLLTAA IDGAGLPAVT TPITLNGGKH
TTTTITRAAA ADQFRILTVD TGGDLTLNHL TITGGQTTNA GTDGAGVLVD AGGTLTSNHS
AITRNIAGGS GGGIANNGTT HVHASNVSHN TASAAAGGVA SSGVLEISKS SIHANAAVDG
GGVTSSGTVR IEHSRISGNR ARSSGGGLFV MRGTGSVVDS SVTTNIAGDV VGGIIASLGV
QMTIRSSVIA ENVAEAGIVG GLGVDEDASV VVTDSIIKNN NTSSNGGGVY NIAELVLRNT
KVFGNQSGDQ GGGIYNEASA TLALFGTKVT KNIAVTDGGG IFNQVGGTVD LNNATGTVVV
KNRPNNCINV PDCLG