Gene Namu_4194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4194 
Symbol 
ID8449820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4633041 
End bp4634129 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content67% 
IMG OID645043243 
Productglycosyl transferase family 2 
Protein accessionYP_003203472 
Protein GI258654316 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCTC AGCACGATCA GCAGACGCAG GTCGCGGTTG TCCTGCGGAC GAGGAATCGT 
CCTCTGCTCC TAGCTCGTGC ACTGGCCAGC GCGGCCGCTC AGACGTACGC CGACTACGTG
GTGATCGTGG TCAACGACAC TGGGGATCCG GGTCCGGTCG ACGACGCGGT CGCCCAGGTC
GCCGACCGCG CGCTGGGTCG CTTCCAGGTG GTGCACAACA CCGTCTCGCG AGGACGCGAG
GCGGCACTCA ACATCGGGCT GGAGGCCAGC TCGTCCTGCT ACGTGGCGGT GCTCGACGAT
GACGACACCT GGGCCTCGTC GTTCCTGGCG CAGACGGTCG ATCATCTCGA GCGCACCGGG
GACCTGGCCG TTGCGACTCG ATCCGAGGTG ATCTACGAGC GCATCGACGG CGAGACGGTC
GTCACCGAGG GCCGTGAGCT GCTGGCATCG GACCGTAATC AGGTGACCCT GCTCGAGACG
ATCGTGCGTA ACTACACCCA CACCGGTTCG CTGGTCTACC GGCGCGACGT GCTGGACACC
ATCGGCCGAT ACGACGAGGC GCTGCCGGTA CTGGCCGACT GGGACTTCCT GCTCAGGCTG
CTCCGTCACG GCGAGGTCGG GTTCATCGAC GGCAACCCCC TCGCCTTCTG GCACCGCCGC
CCGGCGTCGG TCGGCGACGC CCGGAACAGC GTCGAGGGCG ACGAACACAC CCGCTGGGAT
GTCCTGGTCC GTGATCGCTA TCTGCGAGCC GACCTGGCCC GGCACGAGGG ATTGGGCTAC
CTGTTGTTCG TCAGCGAACT GCAGGACCGG GACGCGAGGA TCGCTCAGGC CCGCGGCGCG
CACATCGCCG GCGCCGTGCA CAAGATCGAC ACCGAACTGC TCTCGATGCG AGACACCCAG
ACCTCTCTCG CCGCGGCGGT TCACCACCTG CACGGCACCC AGGATGAGCT GCTGCGCCAG
CTGGTCGAGA TGAATCGCAA CCTGATCAGC CAGAACAACC GGATCGTCGC CCAGTTTGCC
CTGCTCGGCG AGAGGGTGGA GCGGCTCGAA TCGCTGCTCG ACCGCAGGCT TGCCGGTCAA
CTGCGTTGA
 
Protein sequence
MPAQHDQQTQ VAVVLRTRNR PLLLARALAS AAAQTYADYV VIVVNDTGDP GPVDDAVAQV 
ADRALGRFQV VHNTVSRGRE AALNIGLEAS SSCYVAVLDD DDTWASSFLA QTVDHLERTG
DLAVATRSEV IYERIDGETV VTEGRELLAS DRNQVTLLET IVRNYTHTGS LVYRRDVLDT
IGRYDEALPV LADWDFLLRL LRHGEVGFID GNPLAFWHRR PASVGDARNS VEGDEHTRWD
VLVRDRYLRA DLARHEGLGY LLFVSELQDR DARIAQARGA HIAGAVHKID TELLSMRDTQ
TSLAAAVHHL HGTQDELLRQ LVEMNRNLIS QNNRIVAQFA LLGERVERLE SLLDRRLAGQ
LR