Gene Namu_4203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4203 
Symbol 
ID8449829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4644356 
End bp4646881 
Gene Length2526 bp 
Protein Length841 aa 
Translation table11 
GC content67% 
IMG OID645043252 
Productglycosyl transferase family 2 
Protein accessionYP_003203481 
Protein GI258654325 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.138699 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAAGC GCACATCCGA GATGGTGTCC ATCGTCCTGG TCAACTTCCG CGGCGCCGAT 
GACACGATCA CCTGCATCAG GTCGCTGCGC AAGGTCGACT GGCCGGCCGA GAAGCTGGAA
ATCGTCTGCG TGGAGAACGG GTCGGGCGAC GACAGCGCCG CGCGAATCGC CGCGGCCGAT
CCTGGCGTGA CGCTGGTGAA GTCCGACGAC AACCTTGGCT TCGCCGGAGG CTGCAACCTG
GGCGTCCGGC ATGCGCGGGG CGAGTACGTG GCCTTCTTGA ACAACGACGC GCGGCCCGAC
CCGGGCTGGG TGCGGGCGGC GGTCGACGCG TTCAAGACCT CGCCCAACGT GGGTTCGGTC
GCCTCCAAGG TGCTCGACTG GGACGGGCAG AAGATCGACT TCGTCGAGGC GGCGATCACC
TGGTTCGGCA TGGGGTACAA GCCGTTCTGC GAGTCCCCGG ACACCGGCGC CTTCGACGAG
CCGCGGGACG TCCTGTTCGC TACGGGCGCG GCGATGTTCG TGCGGGCCGA CGTGTTCGAC
AGCGTCGGCG GCTTCGACGA GCGCTACTTC ATGTTCTATG AGGACGTGGA TCTGGGCTGG
CGCCTGAACC TGCTGGGCTG GAAGGTCCGC TACGAGCCGC GCTCGCTGGC TTTCCACAAG
CATCACGCAT CAATGAACAA GTTCGGTGCG TTCCGGGAGA CCTACCTGCT GGAGCGCAAC
GCGCTGTACA CGATGTACAA GAATCTGGAC GACCGTTCCC TGGCTCAGTT CCTGCCCGGT
GCGCTGCTGC TGGCCGTTCG TCGCGCGGTG GCCCGGGGCG AATTGGACAG CACCGAGCTG
GACATTCGGC GACCGGGAGA TGACGCCACG CCGGATCGCC CGGTGGCCAA GCAAGCCATG
GCCGGGATCT ACGCCATCGA TCAGCTGGTG GAGAACATCA CCTCGTTCAC GGAGACCCGG
CAGCTGCTCC AGCAGCGGCG CCGGGTCGGC GACTCCGAGC TGCGGCCGTT GTTCGGCAAG
CTGATGGAGC CGGCCTACCC GCTGCCGACT TACCTGGAGG CGCACGAGGA ACTGGTGTCC
GCGCTGGGCA TCGACGCGGC CGGACGCAAG AAGCGTGTCG TCATCATCAC CGGTGAGCCG
GTCTCCGCGG TGATGGCCGG GCCGGCGATC CGTTCCTGGA ACATGGCGCA GTATCTGAGC
CGTGAGCACG AGGTGCGCCT GTTGACGTTC GGTACCGCCG GGGTTCGGCC GGATAAGTTC
GAGGTTCTCT CGGTCTCGCC GCGGGACGCG CACGCGGCCG ACGTGCACAT CGACTGGGCC
GACGTGATCA TCTTTCAGGG ACACGCGATG GCCGTGTTCC CCGCTCTCTA CGAGACCGAC
AAGGTCGTGG TCGTCGACCT GTACGACCCA ATGCATCTGG AGCAGTTGGA GCAGGCGAAG
GAGAAGGGGC CCAAGGCCTG GGCCTTCGAG GTGAACTCGG CCACCGAGGT CCTCAATCAG
CAGTTGGCCC GCGGCGACTT CTTTTTGTGC GCGAGTGAAC GTCAGCGTCA CTTCTGGCTC
GGCCAGTTGG CCGGTGAGGG ACGCCTCAAC CCCCTCACCT ATGCCCAGGA CAATTCGCTG
GGCAGTCTGC TGGCCCTGGT CCCGTTCGGA CTTCCGGCCG CGGAACCGGT TCGGACTGCT
CCGGCCCTGC GGGGGGTCGT CGACGGCATC GGCGCCGACG ACAAGATCGT GATCTGGGGA
GGCGGCATCT ACAACTGGTT CGACCCGCTG AGTCTGATCC AGGCGATTTC CGGGCTCGCT
CGAACCCATC AGGACATCCG GCTGTTCTTT CTGGGCATGC AGCATCCCAA CCCGGCGGTC
CCGGAGATGC AGATGGCGGT GCGCGCCCGG CAGCTGTCGG AGGAGCTCGG CCTCACCGGA
CGCCATGTGT TCTTCAACGA GGAGTGGGTG GCCTACAACG CCCGGCAGAA CTACCTGCTG
GATGCCGATG TCGGGGTCAG CACGCATTTC GAACACATCG AAACCACCTT CTCTTTCCGT
ACCAGGATCC TGGACTACCT GTGGACCAGG CTGCCGATCG TGACGACTCG CGGCGATGGG
TTCGGTGATC TGGTCGCGGC CGAGGGTCTG GGCGTCGCCG TGCGCGAGAA CGACCCGCAG
GCCCTGGCCG ACGCCCTCGA AATCATGCTG TACGACGACG TCGAACGAGG CCGGGTGATC
CGCAATCTGG ATCGGGTCCG GGCCGAGTTC ACCTGGGACA AAACGCTGGC CCCGTTGCTG
GAGTTCTGCC GCGATCCGCA CCCGGCCGCC GACCGGGTCT TGCCGGAAAC GATGACGCCG
GTGCGAACGG CCGGGCCGAC ACGGGTTCTG GCCGAGCGAG TCCGCTCCGA CCTGGCGATC
GCCGGGCGAC ACCTGCGCAG CGGCGGCGTC CGCGAGATGG TGGGCGCGGC GGCCGGCCGC
GTGAAGCGGC AGCTGACCGC CCGCAAACGT AAAGCCGAGC GGGCACGGGC GGCGAGGGCG
CAATGA
 
Protein sequence
MSKRTSEMVS IVLVNFRGAD DTITCIRSLR KVDWPAEKLE IVCVENGSGD DSAARIAAAD 
PGVTLVKSDD NLGFAGGCNL GVRHARGEYV AFLNNDARPD PGWVRAAVDA FKTSPNVGSV
ASKVLDWDGQ KIDFVEAAIT WFGMGYKPFC ESPDTGAFDE PRDVLFATGA AMFVRADVFD
SVGGFDERYF MFYEDVDLGW RLNLLGWKVR YEPRSLAFHK HHASMNKFGA FRETYLLERN
ALYTMYKNLD DRSLAQFLPG ALLLAVRRAV ARGELDSTEL DIRRPGDDAT PDRPVAKQAM
AGIYAIDQLV ENITSFTETR QLLQQRRRVG DSELRPLFGK LMEPAYPLPT YLEAHEELVS
ALGIDAAGRK KRVVIITGEP VSAVMAGPAI RSWNMAQYLS REHEVRLLTF GTAGVRPDKF
EVLSVSPRDA HAADVHIDWA DVIIFQGHAM AVFPALYETD KVVVVDLYDP MHLEQLEQAK
EKGPKAWAFE VNSATEVLNQ QLARGDFFLC ASERQRHFWL GQLAGEGRLN PLTYAQDNSL
GSLLALVPFG LPAAEPVRTA PALRGVVDGI GADDKIVIWG GGIYNWFDPL SLIQAISGLA
RTHQDIRLFF LGMQHPNPAV PEMQMAVRAR QLSEELGLTG RHVFFNEEWV AYNARQNYLL
DADVGVSTHF EHIETTFSFR TRILDYLWTR LPIVTTRGDG FGDLVAAEGL GVAVRENDPQ
ALADALEIML YDDVERGRVI RNLDRVRAEF TWDKTLAPLL EFCRDPHPAA DRVLPETMTP
VRTAGPTRVL AERVRSDLAI AGRHLRSGGV REMVGAAAGR VKRQLTARKR KAERARAARA
Q