Gene Strop_3430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3430 
Symbol 
ID5059899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3937947 
End bp3939701 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content73% 
IMG OID640475679 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001160239 
Protein GI145595942 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0765311 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACCCCA CCCCCGCCAC CTCGCCGCAG CAGCCGGAGC ACCGGCCCGC ACCCGCGACC 
TCGGGCGCGA CGGGCACCGT GCCGGCCACC GACGGCGGCA CGAGCATCCC GACGACGTCC
CGCCCAGCGA CCGCCCAGCC GGCAGCGGGC GAGCTGGCCC GGCTGGCGGC CCACGAAGCA
GGCACCGGAC TCGCCCCCGC CCCGGTCCGG CTGGGCGACG TGGTGCCCGC ACCCGAACAG
GTGCGACCGG ATCCCCGCGC CGACTTCACA CTGTCGGCCG ACACGACAGT CCGGGTCAGC
CCCGAACCCG ACGCGCGGGC CGTCGCCGAA CGCCTCGCCG ACCTGCTCCG GCCGGCCACC
GGATATCGGC TCCCGATCAC CGAAGCGCAG CACCCCGAGC TGGCCGACGA CATCGCGCTC
GTCCTCGTGC AGCAGCCCGA CCTCGGCCCC GAGGGCTACC GACTCGACGT CACGCCGACC
GGCGTCCGGA TCAGTGCCGC CACGGCGGCC GGGCTCCACC ACGGCATCCA AACCCTGCGC
CAGCTCCTCC CGGCCACGAT CGAGAGCAGC ACTCCGGTCC GCGCCGCCTG GACGCTGCCC
GGTGGATCGA TCACCGACCG GCCACGCTTT CCGTACCGGG GCGCCATGCT CGACGTGGCC
CGCCACTTCT TCGGGGTCGA CGACGTCCTG CGGGTGATCG ACCATCTCAC CCGCTACAAG
CTCAACCACC TGCATCTGCA CCTCACCGAT GACCAGGGTT GGCGGATCGC GGTCGAGTCC
CGGCCGCGAC TGACCACAAT CGGCGCCCAC ACGGCGGTCG GTGGCTCCCC CGGCGGGTGG
TACACCCCGG CCGACTACCA GCGGATCGTC GCGTACGCGG GCGAGCGGCA CCTCACCGTC
GTTCCGGAGA TCGACCTGCC GGGGCACACC AACGCCGCGC TGACCGCCTA CCCGGAGCTG
GCCCCGGAGG GGACCACACC GGTGCCCTAC ACCGGCACCG ATGTCGGCTT CAGCTACTTG
GACCCGACCA ACGCACAGAC GTACGAATTC GTCGCCGACG TGCTGGGGGA GGTCGCCGCG
CGCACTCCCG GGCCGTTCCT GCACATCGGC GGCGACGAGG CCTTCGCAGT AAAGGGAGCG
GCGTACACCG GATTCGTCGA GCGGGCGCAA CGCATCGTGG CCGAGTTCGG CAAGACGGTG
GTGGGCTGGC ACCAGCTGGC TCCGGCAGCC CACAGCGAGG GGCGGGTACT CCAGTGGTGG
GGCACCGACG GCGTCGACCC GACGACCGCC GACGCGGTCC GCCGAGGCGC CCGGCTGATC
CTCTCCCCCG GCAACCACGC GTATCTGGAC ATGAAGTACA CCTCCGACAC CCCGATCGGG
CATGACTGGG CTGGCCTGAT CGACGTACGG CGGGCGTACG ACTGGGATCC GGCGACCCGG
GTGACGGAGG TGCCGGCGGA GGCGGTGCTG GGCGTGGAGG CCCCGCTCTG GACCGAGTCG
GTCACCTCGC TGGCGGAGAT CGAGTTCATG CTCCTGCCCC GGCTGCCCGC CATCGCGGAG
CTGGGCTGGT CACCGCGGGC CACCCACGAC TGGGCGGCGT TCCGCACCCG GTTGGCCGGG
CACGGACCCC GCTGGACGGC AGCCGGGATC GCCTTCCACC GCTCCCCCGA GATCCCCTGG
CCAACAACGC CCGCCGACCC GCCGACGGCG CAGGTCGCGC CACCCGCGCC GCGTCCCCGT
ACCGGGCGGG GATAG
 
Protein sequence
MHPTPATSPQ QPEHRPAPAT SGATGTVPAT DGGTSIPTTS RPATAQPAAG ELARLAAHEA 
GTGLAPAPVR LGDVVPAPEQ VRPDPRADFT LSADTTVRVS PEPDARAVAE RLADLLRPAT
GYRLPITEAQ HPELADDIAL VLVQQPDLGP EGYRLDVTPT GVRISAATAA GLHHGIQTLR
QLLPATIESS TPVRAAWTLP GGSITDRPRF PYRGAMLDVA RHFFGVDDVL RVIDHLTRYK
LNHLHLHLTD DQGWRIAVES RPRLTTIGAH TAVGGSPGGW YTPADYQRIV AYAGERHLTV
VPEIDLPGHT NAALTAYPEL APEGTTPVPY TGTDVGFSYL DPTNAQTYEF VADVLGEVAA
RTPGPFLHIG GDEAFAVKGA AYTGFVERAQ RIVAEFGKTV VGWHQLAPAA HSEGRVLQWW
GTDGVDPTTA DAVRRGARLI LSPGNHAYLD MKYTSDTPIG HDWAGLIDVR RAYDWDPATR
VTEVPAEAVL GVEAPLWTES VTSLAEIEFM LLPRLPAIAE LGWSPRATHD WAAFRTRLAG
HGPRWTAAGI AFHRSPEIPW PTTPADPPTA QVAPPAPRPR TGRG