Gene Sare_1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1120 
Symbol 
ID5706063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1264465 
End bp1265790 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content64% 
IMG OID641270635 
Productpolymorphic outer membrane protein 
Protein accessionYP_001536019 
Protein GI159036766 
COG category 
COG ID 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00052776 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGGCC TGGCCCTCAC CACCGTCGGC GTCGCCGCCA CCCCGATAGC AGACGCGGTC 
GGACGCGCCA TCAGCAGCGA CGCCGACCGG CCCGGGAAAC CCACGGGCGA CCGATCAGCC
ACCGGGGACA GCCACGACGA CCGAGGCAAG GACGACAAGG GTACGCGCGA CGACAAGAAC
GGGGAAACGA AGAGGAAGCC GAAGGGCATT CCGGTCCCCT GCGACGCGGA CAAACTGATC
GCCGCGATCA CCCTGGCCAA CGCCCGCGGC GGCGCCGTGC TCGACCTCGC CAAGAAATGC
ACCTACCTAC TCACCGCCAA CATCGACGAC GGCAACGGCC TACCCACCAT CACCGCCCCC
ATCACCCTCA ACGGCAGCAA ACACACCACC ATCAAGCGCG CCGCCGGGGT GGAGCAGTTC
CGCATCGTCA CCGTCGGCAC CGGCGGCGAC CTCGCCCTCA ACCACCTCAA AATCACCGGC
GGACAGACCG ACGGCGACGG CGGAGCAATC CTGGTCAACA CCGGCGGACG ACTGAACGCG
AAACACAGCA CCATCACTCG CAACATCGCC AACGGAACCG GCGGTGGCGG CGGTATCGCC
AGCGCCGGCG TAACGACCCT CGAACACACC ACCGTCAGCC GCAATATCAC CAGCAGCTTC
GCTGGCGGAA TCTACAACCC GGGTGGACAA CTGACTGTCA CCAAGTCGCT GGTAAAGGCG
AACACCGCCA ACAGCACCGG TGGGGTGGCA AGTGTCGGCA GCTCCGCGGT GGTGACGATA
ACGAAGAGCG TCATCGCGGA CAACAGTTCC CAGGGCACGG TCGGGGGTTT GCTGATCCTC
AACGATGGGG TCGGCAGAGT CGACGACACC AAGATTTCCG GTAACACCGC AGGCTCGTTC
GGTGCGATCT ACGTCGACGG ACAAATCACT CTACAAAAGG TCGACATCAC GAACAACACC
GCCTCGAGTG GGTTCGCCGG CGGGCTGTTC GTGGGCACTG ACTCCGTCGC TGTGGTCGAC
AAGGGCCTCA TCAAGGGCAA CATCTCCCTC ACGAACTTCG GCGGTGGCGT ATACAGTTTC
AGCGAGCTGG TAATGCGCGA CACGAAGGTC ATCGGCAATC AGGCCGAGCA GGGTGGCGGC
ATCTACAACT CCGGGGGCAC GGTAACGCTG TTCAACACAC AGGTGGTGAA GAACATCGCC
GTGACCGACG GCGGCGGCAT CGTCAACAAT GGCGGGACGG TTGACCTGAA CACCGCCACC
GGCACCATCG TCATCAAGAA CCGGCCCAAC AACTGCGTTG GCAACGTTCC CGACTGCCCG
GCCTGA
 
Protein sequence
MTGLALTTVG VAATPIADAV GRAISSDADR PGKPTGDRSA TGDSHDDRGK DDKGTRDDKN 
GETKRKPKGI PVPCDADKLI AAITLANARG GAVLDLAKKC TYLLTANIDD GNGLPTITAP
ITLNGSKHTT IKRAAGVEQF RIVTVGTGGD LALNHLKITG GQTDGDGGAI LVNTGGRLNA
KHSTITRNIA NGTGGGGGIA SAGVTTLEHT TVSRNITSSF AGGIYNPGGQ LTVTKSLVKA
NTANSTGGVA SVGSSAVVTI TKSVIADNSS QGTVGGLLIL NDGVGRVDDT KISGNTAGSF
GAIYVDGQIT LQKVDITNNT ASSGFAGGLF VGTDSVAVVD KGLIKGNISL TNFGGGVYSF
SELVMRDTKV IGNQAEQGGG IYNSGGTVTL FNTQVVKNIA VTDGGGIVNN GGTVDLNTAT
GTIVIKNRPN NCVGNVPDCP A