Gene Namu_3816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3816 
Symbol 
ID8449435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4190275 
End bp4191426 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content74% 
IMG OID645042866 
Productglycosyltransferase, MGT family 
Protein accessionYP_003203102 
Protein GI258653946 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0000289865 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0868297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGATCC TGGTGGCCGC CATTCCCGCC TACGGGCACC TGTTTCCGGT CCTGCCGCTG 
GCGGCGGCGT GTGCGGCCGC CGGGCATGAG GTCACGGTGG CCACCGGCGC CCCCTTCCTG
GGTCGCCTGC CCTGGCCGAC GGTGCCCGGG ATCGCGCCCG GCGTCGAGCT CGACGCCGTC
GTCCGGGAGA CCCAGCGGCG CCACCCGGAC CGGCACGGAC CCGACCTGGC CGTGGCCATG
TTCGCCGACA CCACCGCCGA GCTGGTCAGT GGCGCGCTGG ACGAGGTGTT CGCGCAGTCA
CGACCGGATC TGGTGATCTT CGAGGCGATG AACGTCGGGG CGGCCGTCGC CGCCGACCGG
CACGACATCC CGGCGGTCGG CTTTGCGATC GGCCTGGCTC CGTTCGTCGT CGACCTCATC
CACACCGCGG TCCGCGACTT CCAGGGCCAC TTCTGGATCG ACCGCGGCCG GCCGATGCCC
ACCGGGACCG GGCTGCTCGG CGCCGCCCTG ATCGACCCGT CGCCGCCGTC GTTGTCCGTC
ACCGCCGGGC CGGCGCCGGC CGTCCCGCGC TGGCCGATCC GCTCGGTGGC CTACACCGAC
GGCTCGTCTC CGATCCCGGA CTGGCTGGTG GGCAACCGAT CTCGACCCGT CGTCTATCTC
ACCCTGGGCA CCGTGTCGTT CGGGGCGGTG GAGGTGATCC GGCGGGCCAT CGACCAGATC
GAGCCGCTGC CGGTCGACCT GCTGGTCGCC GTCGGGCCCG AGGGCGACCC GGCCGCCCTC
GGTGCCGTCG GTGCGCGCGT GCATGTCGAA CGGTTCGTGC CGCAGGCCCA GGTGCTGGAG
CGGATCGAGG TGATCGTGCA CCACGGTGGC ACCGGCACCG TGCTCGGTGC GGCCGCCGCG
GGCGTGCCTC AGCTGATCCT GCCGCAGGGT GCCGACCAGT TCCTCAACGC CGACCTGGTC
ACCCGGGCCG GCATCGGTCG GGCACTGCGC GCGGACGAAC AGGTGGGCGG GGCGATCACC
GACGCCGTCC GGGTCCTGTT GGACGCCGGT CCGCCGCGGC AGCGCGCCGC GCAGCTGCGG
GCCGAGATCG CCGGGATGCC CGCGCCCGAG ACGATCGTGC CGCGACTGGA GAAGCTGGTC
GCCGGCGGCT GA
 
Protein sequence
MRILVAAIPA YGHLFPVLPL AAACAAAGHE VTVATGAPFL GRLPWPTVPG IAPGVELDAV 
VRETQRRHPD RHGPDLAVAM FADTTAELVS GALDEVFAQS RPDLVIFEAM NVGAAVAADR
HDIPAVGFAI GLAPFVVDLI HTAVRDFQGH FWIDRGRPMP TGTGLLGAAL IDPSPPSLSV
TAGPAPAVPR WPIRSVAYTD GSSPIPDWLV GNRSRPVVYL TLGTVSFGAV EVIRRAIDQI
EPLPVDLLVA VGPEGDPAAL GAVGARVHVE RFVPQAQVLE RIEVIVHHGG TGTVLGAAAA
GVPQLILPQG ADQFLNADLV TRAGIGRALR ADEQVGGAIT DAVRVLLDAG PPRQRAAQLR
AEIAGMPAPE TIVPRLEKLV AGG