Gene Sare_1605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1605 
Symbol 
ID5703460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1834885 
End bp1836231 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content64% 
IMG OID641271114 
Productpolymorphic outer membrane protein 
Protein accessionYP_001536489 
Protein GI159037236 
COG category 
COG ID 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.7904 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0102473 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCATT CGCACCACAC CCACGAATCC AGCCGGCCCG GCGGACGCCG GGTGAGGTCA 
CGGTGGTGGG CTGTCGGGCT GGCCGGGATG ACCGGCTTGG CCCTGACCAC TGTCGGCGTT
GCCGCTCCGG CCGCCGGGAG TTGGCCCGGG ACACTCGACC GGGCCTCCAC CGACAACCAC
CGCGGCGAGG GCAGCGACAA GAGCAAGGGG CAGCAGGAGT CGAAGGGCGT ACCGGTTCCC
TGCGACCCGC ACCGGCTGAT CGCGGCGATA GCCCTGGCCA ACGCCCGTGG CGGCGCCGTG
CTCGACCTCG CCACGAAGTG CACCTACACG CTGACCGCCG ACATCGACGG TGCCGGCCTG
CCCGCGATCA GCACCCCGAT CACCCTCAAT GGCGGCAAAC ACACCACCAT CACCCGCGCC
GCCGCCGCCG ACGAATTCCG AATCCTCACC GTCAGCGCCA ACGGTCGACT CACCCTCAAC
GACGTGACCA TCACCGGCGG AGCCTCCGGC AACAGCGGCG GTGGAATTCT CATCGACTCG
GGTGGCGCCG CCGCGATCAA CAGCAGTAAG ATCATCAGAA ATGTTGCCAA TTCCGGTAGT
TCCGGCGGCG GGATCGCGAA CATCGGCGGC GCCCTCGACA TCAAGAACTC AAGCATTGTT
CACAACGTCG CCGCCGGTCT TGGCGGTGGT ATTTTTAGCG TCGGTACGCT GTCCTTGTAC
AAGTCACGAA TCGATACGAA CACCGGCGGT AGCGGTGGGG GTGGCCTCTC CATCAGCGGC
AGCTTCAGTA TTTCGCGAAG CGAAATAGCC GAGAATGAGA CTCCTACGGG GGGCGGCGGC
ATCGTCATCC AAGGTGGCGG GTTTGGAAAG ATCACCGATA CGCGTATCGT GAAGAATGTT
GCGACGGGTG GTCAAGGTGG GGCTGCAATA TTCGGTGCAC CCGGAAAGCT CACCATTTCC
CGATCCGTCA TTGCCGACAA CACCGCTACC AGCGGCCAGG GCGGCGCTCT GTTCTTGGCG
TTCGGGGCGA TGCTCGTCGA GGACAGTGTC ATCAAGAACA ACATCGGCAT CAACGGCGGC
GGCATTCGCA ACCAGGGCGC GACGACGCTG CTGCGGACAA AGGTCACCGG CAATCAGGCC
ACCGGTTTGG GCGGCGGCAT CTTCAACACA GACACCGGCA CGCTGTCCCT CTTCACGACG
AAGATCCTCA AGAACATTTC CGTCAACCCT GGCGGCGGCA TCTTCAACCA GGCGGGCGGC
ACGGTGAACC TGAACACCGC CACCGGAACA ACTGTGGTCA AGAATCGGCC GAACAACTGC
GTCAACGTCA CCGGCTGCCC GGGCTGA
 
Protein sequence
MDHSHHTHES SRPGGRRVRS RWWAVGLAGM TGLALTTVGV AAPAAGSWPG TLDRASTDNH 
RGEGSDKSKG QQESKGVPVP CDPHRLIAAI ALANARGGAV LDLATKCTYT LTADIDGAGL
PAISTPITLN GGKHTTITRA AAADEFRILT VSANGRLTLN DVTITGGASG NSGGGILIDS
GGAAAINSSK IIRNVANSGS SGGGIANIGG ALDIKNSSIV HNVAAGLGGG IFSVGTLSLY
KSRIDTNTGG SGGGGLSISG SFSISRSEIA ENETPTGGGG IVIQGGGFGK ITDTRIVKNV
ATGGQGGAAI FGAPGKLTIS RSVIADNTAT SGQGGALFLA FGAMLVEDSV IKNNIGINGG
GIRNQGATTL LRTKVTGNQA TGLGGGIFNT DTGTLSLFTT KILKNISVNP GGGIFNQAGG
TVNLNTATGT TVVKNRPNNC VNVTGCPG