Gene Namu_4187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4187 
Symbol 
ID8449813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4623543 
End bp4624703 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content69% 
IMG OID645043236 
Productglycosyl transferase family 2 
Protein accessionYP_003203465 
Protein GI258654309 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGACCG TCGCACCGGC CTGCGAGCTG ACTGTCTTGC TGCCGTGTCT CAACGAGTCC 
GAGACGCTGG CCACGTGTAT CCGCAAGGCG CGGGACTCCA TGGAGCGACT CGGCATTGAT
GGGGAGGTGG TGATCGCGGA CAACGGTTCC GACGACGGAT CGCCGCAGAT CGCCGAGCGC
GAAGGGGCCC GGGTGGTCGC GGTTCGGGCC CGCGGGTACG GCGCGGCCCT CGCCGGGGGG
ATCCAGGCGG CCCGGGGTCG CTGGGTCCTG ATGGCCGACG CCGACGACAG CTACGCTCTG
GACGACATCG AGCCGTTCGT CGCGGCGTTG CGCGGCGGCG CCGACTTGGT CATGGGCAAC
CGCTTCGCCG GTGGGATCGA GCGCGGAGCC ATGCCCGCGC TGCACCGTTA CGTCGGCAAT
CCGGTGCTGT CCAAGCTGGG CCGCCTGTTC TTCGGCGTCC CGGTCGGTGA CTTCCACTGC
GGGATCAGGG CCTTTGACCG GGACAAGGTC AGCGCCCTGG GAATGCGCAC GCCTGGGATG
GAGTTCGCCA GCGAGATGGT CGTCCGGTCC TCCCTGGCGG GTTTGCGGAT CGAAGAGGTG
CCGACCACGC TGCGCCCGGA CGGGCGCAGC GGCTCGCCCC ACCTGCGAAC CTGGCGTGAT
GGCTGGCGGC ATCTGAGTTT TCTGTTGGCC CTCACGCCGC GGTGGCTGAT GCTCTATCCG
GCCCTCGTGC TGTTTTCGGT CGGGGGCCTG GGCTTGCTGG CCCTGGCCTT GGGTCCGAAG
CAGGTCGGCA ACGTGGTGTT CAGTGTGCAG ACCATGTTGG CCTGCGCGAC CGGGGTGATC
GCCGGGCTGC AGTTGCTTGG GTTGGCCGTC GTGACGCGGT CCTATGCGGC GCGCCTAGGC
CTGCTGCCGC CCAACGACCG CCTGGAGCGG ATGCTTGAGC GCGTCACCCT CGATCGCGGC
GTGGTCGTCG GCGGCGTGTT GCTGACCCTG GGGGTGGTGG CCTTCGTGGT TGCACTGCTG
GTGTGGGGCT CGCACGGATT CGGGGCGCTC GACCCGATGT CCACCATGCG GCTGCCGATT
CTGGGAATGG TGCTGGTGCT CGGCGGGCTG GAACTGATCA TGGTCAGTTT CACCGTCAGC
CTGTCCCGCC GTTCCGGTTG A
 
Protein sequence
MATVAPACEL TVLLPCLNES ETLATCIRKA RDSMERLGID GEVVIADNGS DDGSPQIAER 
EGARVVAVRA RGYGAALAGG IQAARGRWVL MADADDSYAL DDIEPFVAAL RGGADLVMGN
RFAGGIERGA MPALHRYVGN PVLSKLGRLF FGVPVGDFHC GIRAFDRDKV SALGMRTPGM
EFASEMVVRS SLAGLRIEEV PTTLRPDGRS GSPHLRTWRD GWRHLSFLLA LTPRWLMLYP
ALVLFSVGGL GLLALALGPK QVGNVVFSVQ TMLACATGVI AGLQLLGLAV VTRSYAARLG
LLPPNDRLER MLERVTLDRG VVVGGVLLTL GVVAFVVALL VWGSHGFGAL DPMSTMRLPI
LGMVLVLGGL ELIMVSFTVS LSRRSG