Gene Sare_1827 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1827 
Symbol 
ID5704632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2106572 
End bp2108140 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content74% 
IMG OID641271329 
Producthypothetical protein 
Protein accessionYP_001536704 
Protein GI159037451 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.356009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0154869 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCG TGACGCACCG CCCGGTGTCG GGCGCCGCTG GGAGGCCGGA CGAGGCGGAG 
CACCGGGGTA GTGCGCGCAG CGGTGCGATC GGGCTGATCG GCGCCGCGAC CAGCGGACTG
TTCGGTTTCG TCCTGGCGGT GGTCATCACC CGGGGCTACG GCACCACCGG TTCCGGTGCG
TTCTTCGCCG CGATCGGGGT GGTTACCGTG GCCACCGCCG TGTGCACACT GGGCGCCGAG
ACCGGTCTGA TGTGGGCACT GCCGCGTCGC CGCGTCGGGA ACGCCGCCCG GGTGTTGCCG
GTGGCACTGC TCCCACCGTT CGCGGTCGCC GTGGCCGTCG CCGTGGCCGG CGTGCTCGCC
GCCGGCTCGC TGGCACCCCG GGTACTCGGC ACCGCCGGGG GGGCGTCGCT GCTGGCGGTG
AGCTTCGCCG CCGTGCCGGT GGTCGTCATC CTGACGCTGC TGCTCGCCGC CCTGCGCTGT
GTCCGGCCGA TTCGGGCGTA CGTGTCGGTG CAGTTCTTTC TCCTCCCGGT GGCCCGACCG
GTGCTGGTCG GCGCCGCCGT CCTGGTCGGC GGTGGCCTGG TCGCCGGCGT GACCGCCTGG
CTGGTTCCGG CCGCACTGGC CCTGCTGGTC TGCCTGGCCC TGGTGGCGGG GCCGTTGCAC
ATCGGGCACG GCGCCGGGTT GCGCCCCGAG CGGCGGGACT GGTCGACCTT CTGGCGGTTC
GCCCTGCCCC GGGCTGCCTC GGCCGCCATC GACGCCGGCA ACCTGTGGGT CGGGGTGCTG
CTGACGTCGA CGCTCGCCGG GGCGAACGAG GCCGGCGTGT TCGGTGCCGT CGGCCGGTAC
GTTCTCGCCG GCCAGCTCGC CATGCAGGGG CTTCGGGTGG CGGTGTCCCC GCAGCTGTCC
CGGCTGCTCG GCGAGGGCCG GCCGCACCTC GCGGCGGCCG TGCACCGGCA GCTGACCACG
TGGGGGCTGG TGCTGTCCTG GCCGGTCTAC CTCCTGCTCG CCGTTTTCGG GCTGGCCTTC
CTCGAACTGT TCGGGCCCGG CTTCACCGCC GGAGCCACCG CGATGACCAT CCTCGCGCTG
GCGATGCTGG TGAACACGGG GGTGGGCAAC GTGCAGAGCC TGCTGCTGAT GAGCGGCCGC
AGCGGGCTGC ACCTGGCCGC CACCCTGGCC GGGCTGCTGG TGACCGTCTC GCTCGGCCTG
GTGCTGATCC CCGGCCACGG CGCCACCGGG GCGGCACTGG CCTGGGCCGC GGGCATCGTC
ACCGAAAATC TCACCGCCGA GACGTGCGCG TGGTTCGTGG TGCGGCATCC ACTGGTGGAC
GGGGCGATGG TGCGGGCGGC GGCAGCGACC GTGACCGGGG TGGGTGCTGT CGCCGGGGTG
GGTGTGCTGG TGGGCGGACG GGGGATCACC GGCCTGCTGG TGGCGGTGGC TGGGCTGGCC
GTCGGCTGCG TCGGCTTGTT GACGGTGCCC CGAGTGCGGC GGGCCATCAG AGCGACCGTG
CGGCAGGTCC GTGGGCGGGA GGCGACTGTG CCGGCCACCC TCGGAGAACC AGAACAGAAG
GACAGGTGA
 
Protein sequence
MTAVTHRPVS GAAGRPDEAE HRGSARSGAI GLIGAATSGL FGFVLAVVIT RGYGTTGSGA 
FFAAIGVVTV ATAVCTLGAE TGLMWALPRR RVGNAARVLP VALLPPFAVA VAVAVAGVLA
AGSLAPRVLG TAGGASLLAV SFAAVPVVVI LTLLLAALRC VRPIRAYVSV QFFLLPVARP
VLVGAAVLVG GGLVAGVTAW LVPAALALLV CLALVAGPLH IGHGAGLRPE RRDWSTFWRF
ALPRAASAAI DAGNLWVGVL LTSTLAGANE AGVFGAVGRY VLAGQLAMQG LRVAVSPQLS
RLLGEGRPHL AAAVHRQLTT WGLVLSWPVY LLLAVFGLAF LELFGPGFTA GATAMTILAL
AMLVNTGVGN VQSLLLMSGR SGLHLAATLA GLLVTVSLGL VLIPGHGATG AALAWAAGIV
TENLTAETCA WFVVRHPLVD GAMVRAAAAT VTGVGAVAGV GVLVGGRGIT GLLVAVAGLA
VGCVGLLTVP RVRRAIRATV RQVRGREATV PATLGEPEQK DR