Gene Namu_1834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1834 
Symbol 
ID8447439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2013191 
End bp2014966 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content65% 
IMG OID645040963 
Productglycosyl transferase family 2 
Protein accessionYP_003201213 
Protein GI258652057 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.000172396 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.276871 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTGA ACCAAGAGAA CGCTGATCTG ATCCCGATGG AACCCGACAC CGGCGTGCGC 
GAGTCCGATG ACCGCCGCCG CGGTGCCCGG GATTCACCCT CCCTGATGAT GCTGGTGCTC
GTCGCCACCA TCGGTGTGCT GCTGTACACC ACGTTCCTGT TCGACTTCTC GAACCGGGGC
AACTGGCTTC CGTACCTGAT GGTGCTGTCC GCCGAGTCGG TCATCATCTT CCAGGCCCTG
ATCGCGCTGT GGACCATCCT GTCCAGCGGT CACAACCCGC GGGGCTACCG CTTCCACAAC
GCGCAGAACC GGATCTACGG ACCGAATCAC AAGACTCTGG ATCCGGATCT GGACCTGACC
ACGCTGCCGA TGCACCTGCA CGATTCGCCG GTCGAGCTGG ACGTCTACAT CACCACCTAC
GGTGAGGACC TCGCGACCAT CCGCCGGACG ATTACCGCTG CGCTGGCCAT GCACGGCAAG
CACACCACCT ACGTGCTCGA TGACGGCAAG TCCGACGACG TCCGGGCGCT GGCCGCCGAG
CTGGGCGCCG AGTACATCGT CCGTGAGGGC AACGCCGGCG CGAAGGCCGG CAACATCAAC
AACGCGCTGA GCGTCACCAC CGGCGAGTTC TACGTCGTGC TGGACGCCGA TTTCGTGCCC
AAGGAAGACT TCCTGTACCA GACCGTGCCC TTCTTCGCGG AGACCAATGT GGCCTTCGTG
CAGACCCCGC AGGCCTACGG CAACCTGGAC AACCTGATCT CCCGTGGCGC CGGCTACATG
CAGTCCGTGT TCTACCGGTT CATCCAGCCG GGCAAGAACC GCTTCAACGC CGCGTTCTGC
GTGGGCACCA ACGTGATCTT CCGCCGCAAG GCGATCGAGT CCATCGGTGG CATGTACACC
GAGTCCAAGT CCGAGGACGT GTGGACCTCG CTCAAGCTGC ACGAGAACGG CTGGAAGTCG
GTCTACATCT CCACCGTGCT GGCCGTCGGC GACACCCCCG AGACCATCGA GGCCTACACC
AAGCAGCAGC AGCGCTGGGC GACCGGCGGG TTCGAGATCC TGCTCAAGGC CAACCCGTTC
TCCCGCAAGC GCAAGCTGAC CCTGGACCAG CGCCTGCAGT ACTTCGGCAC CGCCACGTTC
TACCTGATCG GCATCGCCCC CGGCGTCCTG TTGCTGGTGC CGCCGCTGCA GATCTACTTC
GGTCTGGCCC CGATCAACAC CGGCGTCAGC TTCGGCCAGT GGCTGCTGTA CTACGCGGGC
TTCTACTTCA TGCAGATCAT CGTCGCGCTG TACACCATCG GGTCCTTCCG CTGGGAAACC
CTGATGCTGG CCACCGCCTC GTTCCCGATC TACGGCAAGG CCCTGGTCAA CGCGGTGTTC
AAGAAGGACA CCAAGTGGCA CGTGACCGGT GCCCAGCGGC GCAAGGCCTC CCCGTTCAAC
TTCATCACCC AGCAGCTGAT GGCCTTCGTC TTCCTGGCCA TCACCTCCGT GGTCGGCATC
TGGCAGGCCA TGACGGTCAG CGCCTTCACC CTGGCGCTGT TCTGGAACCT GCTGAACACC
TTCATCCTCG GCGCGTTCGT GATCACCGCG TTCCGCGAGA GCCGGCACAA CCGCCGCGAG
GAGAAGGGCC TGCCGCCCAA GGGCAGCGCG AAGGTGGCCG CCGAGGCCGC TGCGGCCCGG
GCCCTGGCCC AGGACAAGAT CGGTTCCGGC CTGTCTGAGC GGACCTACGA GGGACTGCCC
CCGGTGCGGA TCGACACCGC CGCCGGCGCC CGCTGA
 
Protein sequence
MSLNQENADL IPMEPDTGVR ESDDRRRGAR DSPSLMMLVL VATIGVLLYT TFLFDFSNRG 
NWLPYLMVLS AESVIIFQAL IALWTILSSG HNPRGYRFHN AQNRIYGPNH KTLDPDLDLT
TLPMHLHDSP VELDVYITTY GEDLATIRRT ITAALAMHGK HTTYVLDDGK SDDVRALAAE
LGAEYIVREG NAGAKAGNIN NALSVTTGEF YVVLDADFVP KEDFLYQTVP FFAETNVAFV
QTPQAYGNLD NLISRGAGYM QSVFYRFIQP GKNRFNAAFC VGTNVIFRRK AIESIGGMYT
ESKSEDVWTS LKLHENGWKS VYISTVLAVG DTPETIEAYT KQQQRWATGG FEILLKANPF
SRKRKLTLDQ RLQYFGTATF YLIGIAPGVL LLVPPLQIYF GLAPINTGVS FGQWLLYYAG
FYFMQIIVAL YTIGSFRWET LMLATASFPI YGKALVNAVF KKDTKWHVTG AQRRKASPFN
FITQQLMAFV FLAITSVVGI WQAMTVSAFT LALFWNLLNT FILGAFVITA FRESRHNRRE
EKGLPPKGSA KVAAEAAAAR ALAQDKIGSG LSERTYEGLP PVRIDTAAGA R