Gene Strop_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2079 
Symbol 
ID5058542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2352777 
End bp2353886 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content69% 
IMG OID640474342 
Productglycosyl transferase family protein 
Protein accessionYP_001158908 
Protein GI145594611 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.377907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0768803 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAAGG CCCCACCGGA TACCCGGGCC GGCTTTCCGC TGGCCCGTGA TGAAAAGCCG 
TTATCGATAC GTAGGCGGTA CTTAATGTCG CGTACACCTG ATGTCAGCGT GGTCATCCCG
ACGTGCGACC GGCCGGCCTT GGTGACTCGG GCGGTCCAGA GCGCCCTCAA TCAGTCCGTC
ACCACCATCG AGGTCATCGT CGTGGTCGAC GGTGCGGACG CCGGAACGCT CGCCGCGCTC
GCCGCGCTGC GGGACCCGCG CCTACATGTC CTTCCGCTGA CTGAGCGGGC CGGCGCGCCG
AACGCGCGCA ACGTCGGCGT CGCGGCGGCC CGCGCCGAGT GGACGGCGTT CCTTGACGAC
GACGACGAGT GGCTGCCCCA CAAGCTCGAG GTCCAGCTCC GGCTCGCCAG GACCGCCACG
GTACCCGCGC CGATCGTCGC GAGCCGGCTG GTCAACCGCA CCCCCCGAGC CGAGTTCGTC
CTGCCACGGC GCCTCCCGGA GCCGGACGAG CCGATCTGCG AGTACCTGAC CGTACGCCGG
GGCCTCTTTC ACGGCGACGG ATTCATCCAG ACCTCGACGA TCCTGGCTTC GACCGCGTTG
CTGCGACGCG TGCCGTTCAC GGTGGGCCTC CGCCGTCAGC AGGAGCTGGA CTGGACGCTG
CGCGCCCTCG CGCACGACGA CGTACGCCTC GTCATGGCCA CTGAGCCACT GGTGCTCTGG
CACCAGGATG AGGACCGGCC CCGAATCAGC CTCTCCTCCC CGTGGAAGGC ACAGCTCGAC
TGGTTGCGCT CGATCCGCAC CCTGGTGACC CCTCGGGCGT ACGCGGCGAT CGCGCTCAGT
ATCATCGGCT CGATGGCGGC CACCACCCGC GATCCGCACG TGTTTCGCAC TGTTCTTGCC
GATGCTCGGC GACATGGTCG GCCGGGTCTT CTCGACTACC TGACGTACCT GCAGATCTGG
CTTATCCCAC CCCAGCTTCG GCACACTCTG CGCGACCACA TCCTGGCTCG GCGACGGGTG
TCGGCGCCCG CCCAGACCCC AGCCGCCGAT ACCGCGCCCA GACCAGCCGA GCCCAACCGG
ACCGGCGCCG CCGCGTCCCA GAACCCCTGA
 
Protein sequence
MHKAPPDTRA GFPLARDEKP LSIRRRYLMS RTPDVSVVIP TCDRPALVTR AVQSALNQSV 
TTIEVIVVVD GADAGTLAAL AALRDPRLHV LPLTERAGAP NARNVGVAAA RAEWTAFLDD
DDEWLPHKLE VQLRLARTAT VPAPIVASRL VNRTPRAEFV LPRRLPEPDE PICEYLTVRR
GLFHGDGFIQ TSTILASTAL LRRVPFTVGL RRQQELDWTL RALAHDDVRL VMATEPLVLW
HQDEDRPRIS LSSPWKAQLD WLRSIRTLVT PRAYAAIALS IIGSMAATTR DPHVFRTVLA
DARRHGRPGL LDYLTYLQIW LIPPQLRHTL RDHILARRRV SAPAQTPAAD TAPRPAEPNR
TGAAASQNP