Gene Strop_2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2119 
Symbol 
ID5058582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2396617 
End bp2397834 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content73% 
IMG OID640474382 
Productglycosyl transferase, group 1 
Protein accessionYP_001158948 
Protein GI145594651 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.111121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0181031 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCG GGATCCTGGC GTATCACTTC CCACCGGAAC CGGCATTCAT CCCGGGCAGC 
CTCGCGGAGG AACTGGCCCG CCGCGGCCAC GAAGTCCGGG TGCTGACCGG ATTTCCCGAC
TATCCGGGTG GGTACGTCTA CCCGGGCTGG CGGCAGCGTT GGCGCCACCA GACCCGCAGC
GAGCGGCTGA CCGTGCGGCG GGTGCCCCGC TACGTCGGCC GCAGTGGCTC CGAGCGCGGC
CGGATGGCCG GTCACCTCTC CTTCGCGGGC AGTGTGTCGC TGGTCGGCCG GCGGTTCTTC
GCCGGTGTCG ACGCGCTCTA CGTTCATCAG CCGCCGGCCA CCGCCTTCGC CGCGGCCCGC
CTGCTTCGGG CGCTTCGTCG GGTGCCGGCC GTCGTGCACG TTCAGGACGT GTGGGCTGGT
CCGAAGCCGG CGGCCGGCGG GGGTGATCGG TGGGCCGCCC GGCTTGCCGG TGCGATGGCC
GCTACCTACC GCCACGCCGA CCGGATCGTG GTGGCGGCGC CCTCGCTGCG GGACGTTGTG
GTGACCGAGG GAGCCGACCC GGGCCGCGTC GAGGTGGTGC TCAACTGGAC CGACGAGCGG
ATCTTCCAGC CGGCTCCGCC GAGCCCGGCC GCTGGTCAAC TAGTCCGGCG CGACGGCCGC
TGCGTGGTCA TGTACGCCGG CACCATCGGT GCCCGGCAGG GGCTGGATAC GGCGGTGCGG
GCGGCGGCAG CGCTCGACCA CAGGATGGAG CTGGTGCTGG TCGGGTCGGG TGAGCAGGAG
CGGCGGGTGC GGGGGCTCGC CGCCGAGCTG GGCGCCGACA ACGTGCGGTT CGTCGAACGG
CGCTCGCCGT TGGACATGCC GGAGCTGTAC GCGGCTGCCG ACTACCAGTT GGTCATGCTC
CGGGACCTGC CCGAACTACG CAGCACCCTG CCCGGCAAGC TGCCTACCGC CCTGTCGTGC
GGGGCGCCGG TCATCGCCTC GGCCGGCGGC GACACCGCCG AGGTGGTGGA GAGTGCTCGC
GCCGGACTGT CGTGTCCGCC GGAGGAGTGG GAGACCCTTG CCGACCGGTT CTGGTTGGCC
GCCACCATCC CTCCGGCCGC CCGTGCCGAG ATGGGCCGGC GGGGCCGGGA GGCGTACCTG
CGGCAGATGT CGATGCCGGC CGGAGTGGAA CGGATCGAAT GCCTGCTGGA CGAGGCCGCC
AGCGGACGCC GACGATGA
 
Protein sequence
MKIGILAYHF PPEPAFIPGS LAEELARRGH EVRVLTGFPD YPGGYVYPGW RQRWRHQTRS 
ERLTVRRVPR YVGRSGSERG RMAGHLSFAG SVSLVGRRFF AGVDALYVHQ PPATAFAAAR
LLRALRRVPA VVHVQDVWAG PKPAAGGGDR WAARLAGAMA ATYRHADRIV VAAPSLRDVV
VTEGADPGRV EVVLNWTDER IFQPAPPSPA AGQLVRRDGR CVVMYAGTIG ARQGLDTAVR
AAAALDHRME LVLVGSGEQE RRVRGLAAEL GADNVRFVER RSPLDMPELY AAADYQLVML
RDLPELRSTL PGKLPTALSC GAPVIASAGG DTAEVVESAR AGLSCPPEEW ETLADRFWLA
ATIPPAARAE MGRRGREAYL RQMSMPAGVE RIECLLDEAA SGRRR