Gene Namu_4815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4815 
Symbol 
ID8450445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5360055 
End bp5362382 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content78% 
IMG OID645043854 
Producttransglutaminase domain protein 
Protein accessionYP_003204079 
Protein GI258654923 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGC GGGTTCGCGG GCCGGGCCGC AGCCGCGTCG TCGGCACCGG CCGGTGGTGG 
CGCGCGGCCG ATCTGGCGGT GCTGGCCGGC CTGGTGACCG TGGGCTCGAT GCTGCTGGTG
CCCGCCTACG GGTCGCCGGC GCCGGTGGCC GCCGCGGCGG TCGGCGCCGC GGTGCCGGCC
ACAGTCTTCG CGGTGCTGGA CCGCTGGCCC GGGCGTTCGG CGCTCACCCG ATGGCTGGCC
GCGGCCGTGA TCGGGGCCGG TGTGCTGGTG CTCGGCGCCC TGGTCGTCGC GCCGCAGTCC
CGGGTGCTGG GGGTGCTGCC GTCGGCCGAC GCGATCGCGG CGGTCCTGGC CGGCGCGGTG
GACGGCTGGC GCGACCTGCT GACCGTGGCC ACCCCGACCG GGGTGGCGGG CGGGTTGCTG
GTGCCACCGC TGCTGATCGC CACCGCCGGC ACCACCCTGG CCGCGGTCCT GGCCGGCACC
CGCCGGCCCG CCCTGGCCCT GGTCGGGCCG GCCGCCGCCG CGGTCGCCTA CGCCCTGTTC
GCCGACGTCA CGCTCGATCC CTGGACCGGT GTCGTGGTCG GCGGGTCCCT GCTGGTCGGC
GGGCTGGGAT GGGTCAGCTG GATCGGCGGC CGGACCGCCC GCCGGGCCGA ACGGGCGGCC
GCGGCGTCCC AGCTGGCCGC GGACCGGGTG GCCGGCGCGG ACGGGGCGCT GGGGTTGCGC
CGGCTTCTGG TCGCGGGCGC GATCCTGGCG GTGGCGGCCG TGGTCGGCGG GCTGGTGGCC
GCCGCGGGGA CCCCCGACCG GGCAGCCCTG CGTGAGGTGG TGCGGGCGCC GGTCGACCCG
TCGACGTTCG AGAGCCCGCT GGCTCAGTTC CGTGGTTTCA CCAAGCAGCA CGCCGACGAT
GTCCAGCTCG TCGTGCAGGG GCTGCCGGCC GGCGCCCGGT TGCGGTTGGC CAGCCTGGAC
GACTACGACG GCCGGCAGTT CCGGCTCAGT GACACCGCCG GTGCGTTCGT GCGTATCGGT
CCGGAACGGC CGAGTGCGGC GGCCACCGCC TCGGTGCCGG TGACCGTGCA GGTGCGGGAC
TACGCCGGCC GGTTCCTGCC CCTGCCGGGG GCGATCGAGC GCCTGGACTT CGGCGGCGCG
CGGGCCGACG ACCTGGCCGC CGACCTGCGG TACTCGGCGG GCGAGGCCAC CGGGCTGCTG
CCCGGCGGGT GGCAGCCGGG TGATCAGTAC CGGGTGGTGG CCGGGATCGC GGCGCAGCCC
ACCGTCGACC AGCTGGCCGG GGCCCGGCCG AGCCCGGTCG CCCTGCCCGC CGCCGTGACC
CTGCCCGACC TGCTGCGGTC ACAGACCGAG CGGTACACCA CCGGGGCGCT CACCCCGGCC
GCCCAGGTGG AGGCCATCCG GGCCGGCCTG GCCCGGGACG GGTTCTTCAG CCACGGCCGC
GCCGGCGAGC CGTCCGCACC GGCCGGGCAC GGCCTGGACC GGCTGACCGG CATGGTGGGC
AGCGGGTCGA TGCTGGGCGA CCAGGAGCAG TACGCCGCCC TGATGGCGGT GATGGTGCGC
TCACTGGGCA TCCCGGCCCG GGTGGTGGTC GGCTTCGTCC CCGGCACGCC CAGCGGGGAC
GCCCCGGTCG AGCTGCGCGG GCAGGACATC ACCGCCTGGG TCGAGGTCCC GTTCGACGGC
TTCGGCTGGG TCGCGTTCGA CCCGACCCCC TCGCCGGACA AGCCGGTGAC CGACGTCCAG
GAGCGGGCCC AGACCGAACG GCGGGCGGTC TCGGTCGAGG AGCCGCCGGC CCTGCCGCAG
ATCCCGCCGG ACACCGCCGA CACCGACTCG GCCGAGCAGC AGCCGCAGGA TCCGAACCCG
CCGGCCCCGG CCGAATCACC GACCTTCCTG CCCGCACTGG TGCTGACCAT GCTGGCCTGG
ACCGCGGTGG TGCTGGCCCT GGTGGCGGCG CCGGTGGTGG CGATCCTGCT GATCAAGTCC
CGCCGGCGGC GGCGCCGGCT GGCCGCCCCG ACGCCGGCCG ACCGGATCAC CGGTGGCTGG
CAGCAGCTGC TGGACACCGC GGTGGACACC GGCTACCGCC CCACCCCGTG GCACACCCGC
ACCGAGGCGG CCCGTGACCT GTCCGGGGCC GGGGTGCTGC AGGTGGACTG GCTGGCCCCG
GCGGCCGACG CCGCCCAGTT CTCGCCGACC CCGGTCGAGG AGGACCGGGC CCGCAGTTAC
TGGCGCGAGG TGGACGACCG TAGTGCCGAG CTGCTCGGCG GGCTGGGCTT CTGGCGCCGG
TGGCGAGCCC GGCTCTCCCT GGCGTCGCTG CGCCGCCGGG ACCGCTGA
 
Protein sequence
MSGRVRGPGR SRVVGTGRWW RAADLAVLAG LVTVGSMLLV PAYGSPAPVA AAAVGAAVPA 
TVFAVLDRWP GRSALTRWLA AAVIGAGVLV LGALVVAPQS RVLGVLPSAD AIAAVLAGAV
DGWRDLLTVA TPTGVAGGLL VPPLLIATAG TTLAAVLAGT RRPALALVGP AAAAVAYALF
ADVTLDPWTG VVVGGSLLVG GLGWVSWIGG RTARRAERAA AASQLAADRV AGADGALGLR
RLLVAGAILA VAAVVGGLVA AAGTPDRAAL REVVRAPVDP STFESPLAQF RGFTKQHADD
VQLVVQGLPA GARLRLASLD DYDGRQFRLS DTAGAFVRIG PERPSAAATA SVPVTVQVRD
YAGRFLPLPG AIERLDFGGA RADDLAADLR YSAGEATGLL PGGWQPGDQY RVVAGIAAQP
TVDQLAGARP SPVALPAAVT LPDLLRSQTE RYTTGALTPA AQVEAIRAGL ARDGFFSHGR
AGEPSAPAGH GLDRLTGMVG SGSMLGDQEQ YAALMAVMVR SLGIPARVVV GFVPGTPSGD
APVELRGQDI TAWVEVPFDG FGWVAFDPTP SPDKPVTDVQ ERAQTERRAV SVEEPPALPQ
IPPDTADTDS AEQQPQDPNP PAPAESPTFL PALVLTMLAW TAVVLALVAA PVVAILLIKS
RRRRRRLAAP TPADRITGGW QQLLDTAVDT GYRPTPWHTR TEAARDLSGA GVLQVDWLAP
AADAAQFSPT PVEEDRARSY WREVDDRSAE LLGGLGFWRR WRARLSLASL RRRDR