Gene Sare_3843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3843 
Symbol 
ID5707921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4375713 
End bp4376774 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content75% 
IMG OID641273265 
Productaminoglycoside phosphotransferase 
Protein accessionYP_001538627 
Protein GI159039374 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0510] Predicted choline kinase involved in LPS biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.451264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000808201 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCACCC GCCGGCCCGG CTGGTCCGAT CTGCCAGTGG GAATGCGGGC GGCTCTCGCC 
GACCGCCTCG GTGCCCCCGT GGTCGCCACC CGCACCGCAA CCGCCGGTTT CACGCGTGGC
TTCGCCGGGG TACTGACCGC CGCCGACGGC AGTCGGGCCT TCGTCAAGGC CGCGCCCCAC
GACTCCCGCC TAGCCAGCTG GTACGGATGG GAGGCGGCGA TCCTCGACCG GCTCCCGTCC
GGCTTGCCGG CGCCCCGCAC CCGCTGGACG CTGGCCGACT CCAGCTGGTT CGCGATCGCC
CTCGACGTGG TCGACGGCTA CCCGCCCCGG CGCCCGTGGG AGCCGTCGGA GCTGGCCAGC
ACCCTGACCG CGTACGCGGG CGTCGCCGCC GCCCTGAACA CCCCGCCGAA CGACCTCGCC
GCGCTCAACC CGCCCCACCT GGCCGACCTG GCCCAGGCCG ACATCCTCCG TTGGGGCGAT
GTGGCGGCGG GCCGGGAGCC CGCTCCGCCG TTCCCCGCCG GACTTGAGCA GCGGCTGCCC
GAGTTGGTCG GGCTCGAGTC CCGACTCCCG GGGTACGTCG CTTCGGCGTC CAGTCTGATC
CACGGCGACC TACGGCCGGA CAACGTGCTG TTCGGCCCGG ACGGGCAGGT GTGGTTCTGC
GACTGGACCT GGCTCTGTCG CGGTCCGGCC TGGTTCGACC TGGTGACGCT GCTCCTCGGC
GGGCACGCCG CCGGCGATCC GGAGGTCACG GTCGGACCAC TGGCCACGAC CGGCGGCACG
GGGTTGCCTG CCGGCCCGAC GAGGGTGTCC GGCCCGGAGG CCGCCGTCGC ACCAGCGCCC
GCCGTGACCA CGGACTGGCT GGACGCCGCC TTCGCGGCGC ATCCAGCCGC CGCCAACGCT
CCCCCAGACG CCCTGGACGT CACGCTGGCG GCGCTGGCCG GCTACTTCCT CACCACCCCG
ACCCTGGTCC CGGACACCGC GACCGAACAG TTCACGGCCC ACCAGCGCCG CAGCGGCGAG
TACGCATTCG CCTGGCTCGC CTGCCGTCAG GGCTGGTCCT GA
 
Protein sequence
MTTRRPGWSD LPVGMRAALA DRLGAPVVAT RTATAGFTRG FAGVLTAADG SRAFVKAAPH 
DSRLASWYGW EAAILDRLPS GLPAPRTRWT LADSSWFAIA LDVVDGYPPR RPWEPSELAS
TLTAYAGVAA ALNTPPNDLA ALNPPHLADL AQADILRWGD VAAGREPAPP FPAGLEQRLP
ELVGLESRLP GYVASASSLI HGDLRPDNVL FGPDGQVWFC DWTWLCRGPA WFDLVTLLLG
GHAAGDPEVT VGPLATTGGT GLPAGPTRVS GPEAAVAPAP AVTTDWLDAA FAAHPAAANA
PPDALDVTLA ALAGYFLTTP TLVPDTATEQ FTAHQRRSGE YAFAWLACRQ GWS