Gene Namu_4209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4209 
Symbol 
ID8449835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4653281 
End bp4654321 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content67% 
IMG OID645043258 
Productglycosyl transferase family 2 
Protein accessionYP_003203487 
Protein GI258654331 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.220629 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCCGG ATCTCCGGAT CGCCGCGATC GTTCCGTGCC ACAACGAAGA GGCTGCGGTG 
GGCCAGGTGG TCACCGACCT GCGCGCTGCC GTGCCCGGTA TGGCGATCTA CGTCTACGAC
AACCGATCGA CCGACCGGAC GGTCGAGGTC GCTCAAGCGG CCGGCGCCAT CGTGCGGCGT
GAAGAGGTCA AGGGCAAGGG CAACGTGGTC CGTCGAGCAT TCGCCGACAT CGAGGCCGAC
GTGTACCTGC TCATCGACGG CGACGACACC TACGACGCCT TCGCCGCCCC GCGGATGATC
GACACCCTGC TCGCGGGGCC GTACGACCAC GTGCTCGGTG TGCGCAAGCA GACCACCGAC
TCCGCCTACC GGCCGGGCCA CTCGGCCGGC AACAAGCTAT TCAACAGGCT GGTCACGACC
GCCTTCGGCA CCCCGGTCAG CGACATGCTC AGCGGCTATA GGATCTTCTC CCGACGATTT
GTGAAATCGT TCCCGGCGGT GTCCCGCGAA TTCGAGATCG AGACCGAGCT CACCGTGCAC
ACGATGAGCC TGCGCGTGCC GCAGACCGAA GTGCCGGTGG ACTTCAAGGA CCGCCCCGAA
GGCAGCGAGA GCAAGCTCAA CACGTACCGG GACGGATTCA AGATCCTGTC CTTGATCTTC
CAACTCATCC GGCACGAACG TCCGCTGGCG TTCCACACGA TCACCGCCGG TCTCATCGCG
ATCATCGCGC TCATCCTCGG CGTCCCGCTG GTCGTCGAGT TCGGCCGGAC CGGGCTGGTC
CCGCGGTTCC CGACCGCGTT CCTGGCCGCA TCCCTGATGG TGATCGCGGC GCTGGTCCTG
ACCATCGGCG TCGTGTTGGA CGGCATTACC CGCAGCCGGC GCGAATCGGC CCGGTTGGTG
TATCTGGGCT ACGAGGCACC CGGCCGGCCC CGGCACTCTT CGCCCGCCCG GCACGACCGG
CCGGTCACCG GGCCCGAAAC GCGTCAGCCG ACCGCGCCAT TGCATCAGCA AGGGCGACCC
GCCACGGTCG TAGGGGGTTA A
 
Protein sequence
MYPDLRIAAI VPCHNEEAAV GQVVTDLRAA VPGMAIYVYD NRSTDRTVEV AQAAGAIVRR 
EEVKGKGNVV RRAFADIEAD VYLLIDGDDT YDAFAAPRMI DTLLAGPYDH VLGVRKQTTD
SAYRPGHSAG NKLFNRLVTT AFGTPVSDML SGYRIFSRRF VKSFPAVSRE FEIETELTVH
TMSLRVPQTE VPVDFKDRPE GSESKLNTYR DGFKILSLIF QLIRHERPLA FHTITAGLIA
IIALILGVPL VVEFGRTGLV PRFPTAFLAA SLMVIAALVL TIGVVLDGIT RSRRESARLV
YLGYEAPGRP RHSSPARHDR PVTGPETRQP TAPLHQQGRP ATVVGG