Gene Namu_3938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3938 
Symbol 
ID8449557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4346868 
End bp4349099 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content74% 
IMG OID645042983 
Producttransglutaminase domain protein 
Protein accessionYP_003203219 
Protein GI258654063 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.344249 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGCCC TGCTCGGCGC CCTGGCCGTC CTGGGCGGCA CCCTGGCCAT CCCGCCGATG 
ATCAGTGGTA GCTCCTGGTT CTGGCCCACC GCCGAGGTGG TGCTGGTCAT CTGGCTGGTC
GGGGTGGGCG CGCGGCTGGC CCGGATCCCG ATCGCGGCCG TCATCGCGAT GCAGGCGGCC
GCCGCGGCGA TCGCCGTCAC CGCCCTGTTC ACCGCGCGCG GGTGGGGTGG GGTGATCCCC
AACGGTGCGG TACTACAGGA GGCGGGCGAG CTGCTGGACG GCGCCTGGAC CCAGATCCGT
ACCTCGGTCT CCCCCGCCCC GTCCTCGCCC GAACTGTCCT TCCTGATCTG CGTATCGGTG
GCGGCCACCG CGTTCGTCGT CGACCTGCTC ATCACCGCGT GCCGGGCGCC GGCGCTGGTC
GCCCTGCCGC TGCTGTGCCT GTACGCGGTG CCAGCGTCCA TCGACGTCTC GTTGCTGCCC
TGGCCGGCCT TCGCCATCCC GGCGGTGCTG TACGCGCTGG TGTTGGTCGC CGACGGCCTG
TCCGGCCGGG GTGCGGGGGC CGGGGCCCGG GCCGCCCAGG CGGGTTCCGG CCTGGTCCTG
GCCTGCGTGG CCACGGTGAT CGCGCTGGTG GTGGCGGACT CGGTCACCGG TGTCGGCACC
ACCGGCCGGC TCCCGCGGAC CGGCACCGGC GCCAGCACCG GCATCGGCCT GTCCCCCTTC
ACCTCGCTCG AGGGCAACCT GCAGCGGGGT GAGCCGGTGG ACCTGCTGCG GGTCAGCGGG
CTGCCCCAAC CCGAGTACCT GCGCACCGTC GGGCTGCAGC AGTGGACCCC GAACGAGGGC
TGGTCGGTCG ACCAGCTCGA CGACGGACCG CTGCCGCTGC AGCCGATCAC GGTCGGCGAG
ACCCAGGTGA CGGTCACCCC GCTGGACTAC CGGGACATCT TCCTGCCCGT GTACAACGGG
GTCCGCTCGC TGCAGGGGGT CGATCAGGGC TGGTCGTTCG ACTCGGCCCT GGAATCGGTG
CACCGGGCCG AGGCCGTCAC CCCCGACCCG TACCAGGTCA CGGCGATCCT CTCCACACCC
TCGGCCGACG AGCTGCGCAC CGACAGCGTG ACCGGCGGCA GCGACCTGCT CGACACCGGC
GACCTGCGAC CGGAGGTGAT CGCCCGGGCC ACCGAGATCA CGGCCGGGGC CACCACCGCG
TTCGACAAGG CCAGCGCGCT GGTCTCCTAC TTCACCGACC CGGCCAACGG CTTCACCTAC
TCCCTCGACG TGCCCACCGG CACCACCGGC GACAAGTTGC TGGACTTCCT GGACCTCAAG
CAGGGATTCT GCGAGCAGTA CGCCTCCGCG ATGGCCGTCA TGCTGCGCGC GGTCGGGGTG
CCCGCCCGGG TCGCCGTCGG CTTCACCCAG GGTCGGCTGG ACGCCTCCGG CGACTACGTC
ATCACCAGCA ACGACGCGCA CGCCTGGGTG GAGGTGCCGT TCAGCGAGGC CGGGTGGGTG
GAGTTCGATC CGACGCCGCT GGGTGGCGGG CAGGGCGGCC AGCAGGGCTT CACCACCGCC
TCCGGTGAGC CGACGCCGAC GCCGACGGCC AGCGCGGCGA CCTCGGCCCC GCAGACCGAG
CAGGAGCTGG GCGCCAACCG GGCACCGACG GCGGCCGCGA CCACGTCGGC CGGCAGTGCC
GCCGGGAGTG GCGCCGACGA CGGCCCGATC GTGCCGGCCG GCGTCTGGTG GGCGCTGGCC
GTCCTGCTCG TCATCGGCGG CGCGTTGGCC GGTCCGACGC TGGTCCGGCG CCGCCGCCGG
GACCAACGGC TGGCCACCGC CGACGCCGGC GGACCCGGCG CGGCCGCCGC GGCGTGGCGG
GAGATCGAGG ACCTGGCCGT CGACCACGGC ATCGCCCTGG ACCCGGCCCA GTCCGCGCGA
TCCTGCGCGA ACCGGCTGGC CAAGTCGGCC AAGCTCAGCG AGACCGGGCG GGCTCAGCTA
CGGGCGGTGG TCTCGGCGGC CGAGCAGGGC TGGTACGCCG GCGACGCACC GACCGTCACA
CCGTCCGTGG GTAGGGTCTC GGTGGCCGAG CGGACCACGC CCACCGTCCC GAGCACGACC
ACGGCCGGAT CGAGCTCGCC CATGGGTGAC GCGGCCCGCA CGCTGGCCGT CGACCTCGGC
CACGCCGCAC CCCTGTCACT GTCAGAACGG CTGGTGCCGC GCTCGGTGCG ACCGGCCTGG
TGGCGGGGCT GA
 
Protein sequence
MPALLGALAV LGGTLAIPPM ISGSSWFWPT AEVVLVIWLV GVGARLARIP IAAVIAMQAA 
AAAIAVTALF TARGWGGVIP NGAVLQEAGE LLDGAWTQIR TSVSPAPSSP ELSFLICVSV
AATAFVVDLL ITACRAPALV ALPLLCLYAV PASIDVSLLP WPAFAIPAVL YALVLVADGL
SGRGAGAGAR AAQAGSGLVL ACVATVIALV VADSVTGVGT TGRLPRTGTG ASTGIGLSPF
TSLEGNLQRG EPVDLLRVSG LPQPEYLRTV GLQQWTPNEG WSVDQLDDGP LPLQPITVGE
TQVTVTPLDY RDIFLPVYNG VRSLQGVDQG WSFDSALESV HRAEAVTPDP YQVTAILSTP
SADELRTDSV TGGSDLLDTG DLRPEVIARA TEITAGATTA FDKASALVSY FTDPANGFTY
SLDVPTGTTG DKLLDFLDLK QGFCEQYASA MAVMLRAVGV PARVAVGFTQ GRLDASGDYV
ITSNDAHAWV EVPFSEAGWV EFDPTPLGGG QGGQQGFTTA SGEPTPTPTA SAATSAPQTE
QELGANRAPT AAATTSAGSA AGSGADDGPI VPAGVWWALA VLLVIGGALA GPTLVRRRRR
DQRLATADAG GPGAAAAAWR EIEDLAVDHG IALDPAQSAR SCANRLAKSA KLSETGRAQL
RAVVSAAEQG WYAGDAPTVT PSVGRVSVAE RTTPTVPSTT TAGSSSPMGD AARTLAVDLG
HAAPLSLSER LVPRSVRPAW WRG