Gene Sare_1687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1687 
Symbol 
ID5705222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1946346 
End bp1947905 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content63% 
IMG OID641271190 
Productglycosyl transferase group 1 
Protein accessionYP_001536565 
Protein GI159037312 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00440693 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGCGA CGGGGAAAGG CACATCGACG GCGCCTGCCC GGATCGTGAT ACTCGTCGAC 
AATGGTGTCA CTGGCGACTC CCGGGTGCAG AAGACAGCAC GTTCAGCCGC TGACGCCGGG
TGGGATGTGA CTCTGCTCGG CAGCTCACCC AACGGGCGGC CCCAGCAGTG GAGGCTCGGC
TCCGCTACGG TGCGGCTGCT GCCGATGCCG AGCCCGTTGA GGCAACGTCG GCATGAGCTG
CGTCGGCGAT GGTTGCTCGG CCCCCTGGCC TATCCCCCGA CCGGTGTCGC CGCGCGGCGG
CAGCAGGAGG TTCGTGCCTG GCAGGCCGAT CTGAGGGTCC GTCGGGCGCT GTTGACCTCC
GAGGGGGGCT CCTCTCTGGC TCGCCAATGG CTGCGAGCGC AGGCGCTGGC CGCTCGAGTA
GTGCGGAAGT GGATCTCTTT CCGACATTGG CAGTTGACCA ATGGACAAAA AAAGCGTAAG
CGGTTGACGA CCCCGTCGGA CCGCCTCTTC ACGTGGTTGC AGCTGCGTCT GCGGGGCGAC
CGTGCCTGGC GCCGGCTCGA ACCACAGCTG TGGGACTTTG AGCTTGCCTT CGCTCAGGTT
GTTGATCAGC TCAAGCCGGA CATCATCTAT GCCAACGACT TCCGTATGTT GGGTGTCGGC
GCACGTGCCA AGATCAGGGC GGCAGCGGCT GGCCGTGAGA TTAAGTTGAT CTGGGATGTT
CACGAGTATC TTCCTGGCGT GAAGCCACGA GTGGACAACA ACCGGTGGAT GGTTGCCAAT
CAGGCACACG AACGCGAGTA CGCCAGGTGG GCCGATGCGG TGATGACGGT ATCTGACCGA
TTGGCTGAGC TGTTACAACG TGATCATGGA TTGGCCGAGC GGCCGTCGAT CGTACTCAAC
ACGCCGAACG CGGCTGATGC GTTAGGCGCT CACGGCGCTG ATTCCCAGGA TGTGCGCAGT
AAATGTGGAC TCGATCCCGA TGACCCGCTC GTGGTCTACA GTGGGGCGGC GGCAGCGCAC
CGCGGCATGG GTGTGATGGT GGAAGCGTTG CCTCGCCTGT CCGACGCGCA CGTGGCATTC
GTCGTCAATG CCCCAGCCGG GCCCTACATG AAAAGCCTGG TGGCCCGAGC CCGTGAACTC
GGCGTGGCGG ATCGTGTGCA TGTGCTGCCG TACGTCGCGC CGGCGGAGGT GGTTGGTTTC
CTGTCCACCG CGACGTTGGG CGTGATCCCG ATTCACCATT GGCTCAATCA TGAGATCCAA
CTCATCACCA AGTTCTTCGA GTATTCCCAC GCACGGCTGC CGATTGTGGT CAGTGACGTC
GAGACCATGG CGGCCGCTGT ACAGGAAAGC GGGCAGGGTG AAGTCTTCCA GGTTGACGAT
GTGGATGGAT TTGTGATGGC GGTCGAGACG ATCCTGGCGG ATCCCCAGCG GTATCGCAAA
GTATATGACG CGATGGATCT GAGGGTGTGG ACTTGGGAGG AGCAGGCGCG GGTCCAGAAC
AGCATCTACC AGCGATTGGC CCCGCAGGAC CGACACCCTG CCCTGTCGGC GTCCGATTGA
 
Protein sequence
MTATGKGTST APARIVILVD NGVTGDSRVQ KTARSAADAG WDVTLLGSSP NGRPQQWRLG 
SATVRLLPMP SPLRQRRHEL RRRWLLGPLA YPPTGVAARR QQEVRAWQAD LRVRRALLTS
EGGSSLARQW LRAQALAARV VRKWISFRHW QLTNGQKKRK RLTTPSDRLF TWLQLRLRGD
RAWRRLEPQL WDFELAFAQV VDQLKPDIIY ANDFRMLGVG ARAKIRAAAA GREIKLIWDV
HEYLPGVKPR VDNNRWMVAN QAHEREYARW ADAVMTVSDR LAELLQRDHG LAERPSIVLN
TPNAADALGA HGADSQDVRS KCGLDPDDPL VVYSGAAAAH RGMGVMVEAL PRLSDAHVAF
VVNAPAGPYM KSLVARAREL GVADRVHVLP YVAPAEVVGF LSTATLGVIP IHHWLNHEIQ
LITKFFEYSH ARLPIVVSDV ETMAAAVQES GQGEVFQVDD VDGFVMAVET ILADPQRYRK
VYDAMDLRVW TWEEQARVQN SIYQRLAPQD RHPALSASD