Gene Sare_2778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2778 
Symbol 
ID5706170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3159586 
End bp3160587 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content66% 
IMG OID641272234 
Productglycosyl transferase family protein 
Protein accessionYP_001537604 
Protein GI159038351 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.2611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00237783 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTCCTGT CGGTAGTGGT GCCCTGCTTC AACGAGGAGG CCTCGGTCGA GCAGCTGCAC 
ACCGCGGTCA CCGCCGCGGT CGCCGAGCTC TCCGATGTGG AGATCGAGGT GGTCTATGTC
GACGACGGCA GTGTCGACGG CACCCTCGCG GCACTGCGGC GACTCGCCGC CATCGACCCG
GCGGTGCGAT ACACCTCACT GAGCCGCAAC TTCGGCAAGG AGGCGGCGAT GCTGGCCGGC
CTGAAGCGGG CCACCGGGGA CGCCGTCGTG ATCATGGATG CGGACCTGCA ACACCCACCA
CGGCTGCTAC CGGACATGGT GGCGTTGTTC CGGCAGGGTT TCGACCAGGT GATCGCCCGC
CGCGACCGAC GCGGGGACCG GTTCCTGCGC ATGGTGGCCT CGCGGTCCTT CTACCGGATG
GTGAACTGGT GGATCGACGT GCGGCTGTTG GATGGGGCCG GCGACTTCCG GTTGCTGTCC
CGACTCGCTG TGGACGCGGT GCTGGCCATG CCGGAGTACA ACCGCTTTTC CAAGGGTTTG
TTCTCCTGGA TCGGATTCCG GACCGTCGTG ATAACCCACC GCAACGAAAC CCGACGGACG
GGCCGGAGCA GGTGGACGTT CGGCAACCTG TTCAACTACG CGTTCGACGG GCTGCTGTCG
TTCAACAACC GGCCCCTCCG GCTGGCCATC TACGGCGGCC TGTTGCTCAC CCTGATCGCG
CTGGGGTACA TGATCTGGGT GGTCGGGGAT GCCCTCAGCA AGGGGATCGA CGTACCCGGT
TACACCACCA TCATCGTCAG TGTCATCGGT CTGGGCGGTA TCCAGATGGT GCTCCTCGGA
GTGATCGGGG AGTACATCGG CCGGATCTAC TACGAGACCA AACGCCGGCC GCACTATCTG
GTGCAGGAGA CGGATGACCC GGCCCCGGAC CCCCGGACGC CCCGCCCACG ACCGACCCCG
CCGCCGGTCG ACGGCCGAGC CCGTCACCAC CGAGACCGAT AG
 
Protein sequence
MLLSVVVPCF NEEASVEQLH TAVTAAVAEL SDVEIEVVYV DDGSVDGTLA ALRRLAAIDP 
AVRYTSLSRN FGKEAAMLAG LKRATGDAVV IMDADLQHPP RLLPDMVALF RQGFDQVIAR
RDRRGDRFLR MVASRSFYRM VNWWIDVRLL DGAGDFRLLS RLAVDAVLAM PEYNRFSKGL
FSWIGFRTVV ITHRNETRRT GRSRWTFGNL FNYAFDGLLS FNNRPLRLAI YGGLLLTLIA
LGYMIWVVGD ALSKGIDVPG YTTIIVSVIG LGGIQMVLLG VIGEYIGRIY YETKRRPHYL
VQETDDPAPD PRTPRPRPTP PPVDGRARHH RDR