Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3938 |
Symbol | |
ID | 8449557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4346868 |
End bp | 4349099 |
Gene Length | 2232 bp |
Protein Length | 743 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 645042983 |
Product | transglutaminase domain protein |
Protein accession | YP_003203219 |
Protein GI | 258654063 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.344249 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGCCC TGCTCGGCGC CCTGGCCGTC CTGGGCGGCA CCCTGGCCAT CCCGCCGATG ATCAGTGGTA GCTCCTGGTT CTGGCCCACC GCCGAGGTGG TGCTGGTCAT CTGGCTGGTC GGGGTGGGCG CGCGGCTGGC CCGGATCCCG ATCGCGGCCG TCATCGCGAT GCAGGCGGCC GCCGCGGCGA TCGCCGTCAC CGCCCTGTTC ACCGCGCGCG GGTGGGGTGG GGTGATCCCC AACGGTGCGG TACTACAGGA GGCGGGCGAG CTGCTGGACG GCGCCTGGAC CCAGATCCGT ACCTCGGTCT CCCCCGCCCC GTCCTCGCCC GAACTGTCCT TCCTGATCTG CGTATCGGTG GCGGCCACCG CGTTCGTCGT CGACCTGCTC ATCACCGCGT GCCGGGCGCC GGCGCTGGTC GCCCTGCCGC TGCTGTGCCT GTACGCGGTG CCAGCGTCCA TCGACGTCTC GTTGCTGCCC TGGCCGGCCT TCGCCATCCC GGCGGTGCTG TACGCGCTGG TGTTGGTCGC CGACGGCCTG TCCGGCCGGG GTGCGGGGGC CGGGGCCCGG GCCGCCCAGG CGGGTTCCGG CCTGGTCCTG GCCTGCGTGG CCACGGTGAT CGCGCTGGTG GTGGCGGACT CGGTCACCGG TGTCGGCACC ACCGGCCGGC TCCCGCGGAC CGGCACCGGC GCCAGCACCG GCATCGGCCT GTCCCCCTTC ACCTCGCTCG AGGGCAACCT GCAGCGGGGT GAGCCGGTGG ACCTGCTGCG GGTCAGCGGG CTGCCCCAAC CCGAGTACCT GCGCACCGTC GGGCTGCAGC AGTGGACCCC GAACGAGGGC TGGTCGGTCG ACCAGCTCGA CGACGGACCG CTGCCGCTGC AGCCGATCAC GGTCGGCGAG ACCCAGGTGA CGGTCACCCC GCTGGACTAC CGGGACATCT TCCTGCCCGT GTACAACGGG GTCCGCTCGC TGCAGGGGGT CGATCAGGGC TGGTCGTTCG ACTCGGCCCT GGAATCGGTG CACCGGGCCG AGGCCGTCAC CCCCGACCCG TACCAGGTCA CGGCGATCCT CTCCACACCC TCGGCCGACG AGCTGCGCAC CGACAGCGTG ACCGGCGGCA GCGACCTGCT CGACACCGGC GACCTGCGAC CGGAGGTGAT CGCCCGGGCC ACCGAGATCA CGGCCGGGGC CACCACCGCG TTCGACAAGG CCAGCGCGCT GGTCTCCTAC TTCACCGACC CGGCCAACGG CTTCACCTAC TCCCTCGACG TGCCCACCGG CACCACCGGC GACAAGTTGC TGGACTTCCT GGACCTCAAG CAGGGATTCT GCGAGCAGTA CGCCTCCGCG ATGGCCGTCA TGCTGCGCGC GGTCGGGGTG CCCGCCCGGG TCGCCGTCGG CTTCACCCAG GGTCGGCTGG ACGCCTCCGG CGACTACGTC ATCACCAGCA ACGACGCGCA CGCCTGGGTG GAGGTGCCGT TCAGCGAGGC CGGGTGGGTG GAGTTCGATC CGACGCCGCT GGGTGGCGGG CAGGGCGGCC AGCAGGGCTT CACCACCGCC TCCGGTGAGC CGACGCCGAC GCCGACGGCC AGCGCGGCGA CCTCGGCCCC GCAGACCGAG CAGGAGCTGG GCGCCAACCG GGCACCGACG GCGGCCGCGA CCACGTCGGC CGGCAGTGCC GCCGGGAGTG GCGCCGACGA CGGCCCGATC GTGCCGGCCG GCGTCTGGTG GGCGCTGGCC GTCCTGCTCG TCATCGGCGG CGCGTTGGCC GGTCCGACGC TGGTCCGGCG CCGCCGCCGG GACCAACGGC TGGCCACCGC CGACGCCGGC GGACCCGGCG CGGCCGCCGC GGCGTGGCGG GAGATCGAGG ACCTGGCCGT CGACCACGGC ATCGCCCTGG ACCCGGCCCA GTCCGCGCGA TCCTGCGCGA ACCGGCTGGC CAAGTCGGCC AAGCTCAGCG AGACCGGGCG GGCTCAGCTA CGGGCGGTGG TCTCGGCGGC CGAGCAGGGC TGGTACGCCG GCGACGCACC GACCGTCACA CCGTCCGTGG GTAGGGTCTC GGTGGCCGAG CGGACCACGC CCACCGTCCC GAGCACGACC ACGGCCGGAT CGAGCTCGCC CATGGGTGAC GCGGCCCGCA CGCTGGCCGT CGACCTCGGC CACGCCGCAC CCCTGTCACT GTCAGAACGG CTGGTGCCGC GCTCGGTGCG ACCGGCCTGG TGGCGGGGCT GA
|
Protein sequence | MPALLGALAV LGGTLAIPPM ISGSSWFWPT AEVVLVIWLV GVGARLARIP IAAVIAMQAA AAAIAVTALF TARGWGGVIP NGAVLQEAGE LLDGAWTQIR TSVSPAPSSP ELSFLICVSV AATAFVVDLL ITACRAPALV ALPLLCLYAV PASIDVSLLP WPAFAIPAVL YALVLVADGL SGRGAGAGAR AAQAGSGLVL ACVATVIALV VADSVTGVGT TGRLPRTGTG ASTGIGLSPF TSLEGNLQRG EPVDLLRVSG LPQPEYLRTV GLQQWTPNEG WSVDQLDDGP LPLQPITVGE TQVTVTPLDY RDIFLPVYNG VRSLQGVDQG WSFDSALESV HRAEAVTPDP YQVTAILSTP SADELRTDSV TGGSDLLDTG DLRPEVIARA TEITAGATTA FDKASALVSY FTDPANGFTY SLDVPTGTTG DKLLDFLDLK QGFCEQYASA MAVMLRAVGV PARVAVGFTQ GRLDASGDYV ITSNDAHAWV EVPFSEAGWV EFDPTPLGGG QGGQQGFTTA SGEPTPTPTA SAATSAPQTE QELGANRAPT AAATTSAGSA AGSGADDGPI VPAGVWWALA VLLVIGGALA GPTLVRRRRR DQRLATADAG GPGAAAAAWR EIEDLAVDHG IALDPAQSAR SCANRLAKSA KLSETGRAQL RAVVSAAEQG WYAGDAPTVT PSVGRVSVAE RTTPTVPSTT TAGSSSPMGD AARTLAVDLG HAAPLSLSER LVPRSVRPAW WRG
|
| |