Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3901 |
Symbol | |
ID | 8449520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 4302880 |
End bp | 4304319 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645042947 |
Product | glycoside hydrolase family 1 |
Protein accession | YP_003203183 |
Protein GI | 258654027 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.00796853 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0223593 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGGAT TTCCCGAGGG GTTCCTGTGG GGCGGCGCGG TGGCGGCGCA CCAGTTCGAG GGCGGCTGGG ACGCCGGCGG CAAGGGACCG AACGTCGTCG ACGTGCTGAC CGCCGGCGCG CACGGCGTGC CCCGCCGGCT GACCGACTCC GTCGAGCCGG GCACGTTCTA CCCCAACCAC GAGGCGATCG ACTTCTACCA CCGGTTCCGT TCGGACATCG CGCTGTTCGC CGAGCTCGGA CTGCGCTGCT TTCGCACGTC CATCTCCTGG GCCCGGATCT TCCCCCGCGG GGACGAGACC GAGCCCAACG AGGAGGGTCT GGCCTTCTAC GACGCCGTGT TCGACGAGCT GATCGCGCAC GGCATCGCCC CGGTCATCAC CCTGTCGCAC TTCGAGTTGC CGCTGCACCT GGCCCGCGAG TACGGGGGCT TTCGCAACCG CGCCCTGGTC GAGCTGTTCG CCCGGTTCGC CGAGGTGTGC TTTCGCCGGT ACCGGCACAA GGTCCGGTAC TGGATGACCT TCAACGAGAT CAACAACCAG ATGGACACCG ACAACTGGCT GTTCCTGTGG ACCAACTCCG GAGTGCTGGT CGGACCGGAG GAGAACGCCC GCGAGGTGAT GTTCCAGACC GCCCACCACG AGCTGCTGGC CAGCGCCAGG GCGGTCGCCA TCGGGCATGC GATCGACCCC GACCTGCAGA TCGGGGCGAT GGTCTCGCAC GTGCCGATCT ACCCGTTCTC CTGCGACCCG CAGGACGTGA TGGCCGCCCA GATCGCGATG CGGCAGCGGT TCTTCTTCCC CGACGTGCAG GTGCGCGGCG CCTACCCGGC CTACGCGCTC AAGGAGTTCG AGCGCGAGGG CTACCGGATC GCGATGGATC CGCAGGACGC GCAGATCCTG GCCGCCGGCA CGGTCGACTA CCTGGGCTTC AGCTACTACA TGTCCACCGT GGTCAAGGCT GACGCGGTGA ACGAGAACAC CGGCGAGTCG GTCGATTTCA CCCTGCCCAA CGGGGTGCCC AACCCGTACC TGACGGCCAG CGACTGGGGC TGGCAGATCG ACCCGGTCGG CCTGCGGTAC ACGCTGAACA CCCTGTCCGA GCGCTACCAG CTACCGTTGT TCATCGTCGA GAACGGTTTC GGCGCGGTCG ATGTGGTCGC CGACGACGGC ACCATCGACG ACGCCGAGCG GATCGACTAC CTGCGCGCGC ACATCGAGGC GATGCGGGAC GCGATCGACC AGGACGGCGT CGACCTGATC GGCTACACCC CCTGGGGCAT CATCGATCTC GTCTCGTTCA CGACGGGGGA GATGCGCAAG CGGTACGGGA TGATCCACGT CGACCGGGAC AACGAGGGCC ACGGCACGCT GGCCCGGACC CGCAAGCGGT CCTTCGGCTG GTACCGGGAC GTCATCGCCG CCAACGGCGC CGCGCTCTAG
|
Protein sequence | MSGFPEGFLW GGAVAAHQFE GGWDAGGKGP NVVDVLTAGA HGVPRRLTDS VEPGTFYPNH EAIDFYHRFR SDIALFAELG LRCFRTSISW ARIFPRGDET EPNEEGLAFY DAVFDELIAH GIAPVITLSH FELPLHLARE YGGFRNRALV ELFARFAEVC FRRYRHKVRY WMTFNEINNQ MDTDNWLFLW TNSGVLVGPE ENAREVMFQT AHHELLASAR AVAIGHAIDP DLQIGAMVSH VPIYPFSCDP QDVMAAQIAM RQRFFFPDVQ VRGAYPAYAL KEFEREGYRI AMDPQDAQIL AAGTVDYLGF SYYMSTVVKA DAVNENTGES VDFTLPNGVP NPYLTASDWG WQIDPVGLRY TLNTLSERYQ LPLFIVENGF GAVDVVADDG TIDDAERIDY LRAHIEAMRD AIDQDGVDLI GYTPWGIIDL VSFTTGEMRK RYGMIHVDRD NEGHGTLART RKRSFGWYRD VIAANGAAL
|
| |