Gene Strop_2078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2078 
Symbol 
ID5058541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2351640 
End bp2352773 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content73% 
IMG OID640474341 
Productglycosyl transferase, group 1 
Protein accessionYP_001158907 
Protein GI145594610 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.206868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.316316 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGTCAG CCCAGCCCTC GCCCCACGTC GTGATCTGGC GCAGCCACCT GCTACCGGGC 
TCGGAGACGT TCATTCGGAA CCAGGCCGAC GCGCTGACCA CCTGGCGGCC CAGCTACCTC
GGCGCGGTGC GAGTCCCCTC GGCGTTGTCC CGCGACACCG ACACGGTGGC GTACGGCAGT
GCGCGGCGAG ACCGGCGGGA CCTACTCGCG CTGAAGGTGT CCGGCCGGTC ACCCCGACTC
ACCCACTTGC TGCGGCAGCT ACGCCCGGCG CTGGTACACG CCCACTTCGG CGGGGACGGG
TGGCTGATCA GCCGAACGAC CGCTGAACTC GGCATTCCCC TGGTCATTAC CGTGCACGGT
CAGGATGTCA CCCGCCAGCC CGCGCTCCCC GGCCTACGTG GAGCCCGTCA GCGACGCAAC
CTCCGGGCGG CGTTTGACCG GGCGGCCCTG GTCGTCGCCG TCAGTGGGTT CATCCGGGAC
CGCGCCGTCA GCCTCGGGGC TGACCCGGCG AAGGTCCACG TGCACCACAT CGGCGTACCG
ATCCCGCCCC CGCCCGCGGC GACCAGGCGG GAGTGGGATG TCGCCTTCGT CGGACGACTC
GTCGCGAAGA AGGGGGTCGA CGACCTGGTC GAGGCGCTGG GGCTGCTCCG CCCCCGACGA
CCCCGCGCGC TGTTCATCGG CGATGGGCCA CTCGCGGCAC CGTTGCGGAC GCGCGCCGCC
GAACTCGGCC TCAACGCCAC GTTCTGTGGA TCGCAGCCAC CGGCGGTGGT GCGCCGGCAC
CTGGCCGCCG CGCGGCTGCT GGCCGCACCG TCCCGGACGG CGCCAGACGG CGATTCGGAA
GGGCTGCCCA CCACCATTCT GGAGGCGGCG AGCGCCGGTC TGCCGGTGGT GGCGACGTAC
CACAGCGGCA TCCCGGAAGC GGTCGTCCAC GGCACGACCG GGCTACTCGG CGCCGAGGGC
GACCGCGTGG CGCTGGCGGC GAACATCGGC CGGCTGCTCG ACAACGACAC GCTGCGCGAG
CAGCTCGGTC AGGCGGGCCG CCGACACGTT GTGGAGCACT TCGACCTGCG CCGGCAGACC
CAGCGGCTGG AACAGCTCTA CGCCCAGGTC GCGGGAGCTC CTCCGCCCCC GTGA
 
Protein sequence
MPSAQPSPHV VIWRSHLLPG SETFIRNQAD ALTTWRPSYL GAVRVPSALS RDTDTVAYGS 
ARRDRRDLLA LKVSGRSPRL THLLRQLRPA LVHAHFGGDG WLISRTTAEL GIPLVITVHG
QDVTRQPALP GLRGARQRRN LRAAFDRAAL VVAVSGFIRD RAVSLGADPA KVHVHHIGVP
IPPPPAATRR EWDVAFVGRL VAKKGVDDLV EALGLLRPRR PRALFIGDGP LAAPLRTRAA
ELGLNATFCG SQPPAVVRRH LAAARLLAAP SRTAPDGDSE GLPTTILEAA SAGLPVVATY
HSGIPEAVVH GTTGLLGAEG DRVALAANIG RLLDNDTLRE QLGQAGRRHV VEHFDLRRQT
QRLEQLYAQV AGAPPPP