Gene Namu_0550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0550 
Symbol 
ID8446134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp609975 
End bp611918 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content73% 
IMG OID645039684 
Productvon Willebrand factor type A 
Protein accessionYP_003199955 
Protein GI258650799 
COG category[R] General function prediction only 
COG ID[COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAC GTCGCCGGTA TTCCTACCGG GCCTACGACG GCGGCCCGGA CCCGCTCGCG 
CCGCCGTTCG ACCTGCGCGA GGCAATCGAC CGGATCGGCG CCGACGTGCT GGACGGCAGC
TCTCCGCGGC AGGCCCTGCA GGAACTGCTG CGCCGCGGGC TGGGCGAACG GCGCGGGCTG
GACGAGCTGA CCCGCCAGTT GTGGCAGCGG CGGCGGGAAC TGCAACGCAA CAACCGGCTC
GACGGCACCC TGCAGCAGGT CCGGGAACTG CTCGACCGGG CGCTGACCGC CGAGCGAAAG
GCGCTGGCCC GCCAGGATTC GGACGACGCG CGGTTCGGCG AGCTGCAGCT GGACGCGTTG
CCCACCGACG CCGCCGGCGC CGTCCGCGAG CTGGAGAATT ACGACTGGCA GTCCCCCGAC
GGGCGGGCCG CCTACGAGCA GATCCGCGAC CTGCTGGGGC GGGAGATGCT CGACCAGCGG
TTCGCCGGCA TGAAACAGGC GCTGGAGAAC GCGACATCGC AGGACGTCCA AGCCATCCGG
GACATGCTGG CCGACCTCAA CGAGTTGCTG TCGGCGCACG CCCGGCTGGA GGACACCACC
GAGCGGTTCG GCGAGTTCAT GAACCGGCAC GGCCACTTCT TCCCCGAGCA ACCGCGCACC
ACCGACGAGC TGATCGACGT GCTCGCGGCC AGATCCGCGG CGGCGCAACG GATGATGAAC
TCACTGTCGG CCGACCAGCG GGCCGAGCTG GCCGCGCTGA CCGAGCAGGC GTTCGGCGAT
CCCCGGCTGG CGCAGTCGCT GGCCCAGCTG GACGCGATGC TGCAGCAGTT ACGGCCCGGC
CAGGACTGGG ACGGCTCCGG CAGGTTCCGC GGCGACAACC CGATGGGCAT GGGCGAGGCC
ACCCGAGCCC TGGAGGAGCT GGGCCGGCTG GAGTCGCTGG GCGAGCAACT CGCCCAGGGC
TACCCGGGCG CCAGCCTGGA CGACATCGAC CTGGACCTGC TGCAGGACGT GCTGGGCGAG
CAGGCCCGGG TGGACGCCCG GGCGCTGGCC GAGCTGGAAC GTGAACTGCG CGAACAGGGC
CTGCTCGAGC GGGCCGCCGA CGGCTCCCTG CAGCTGTCCC CCAAGGCACT GCGCCGGTTG
GGCCAGACCG CGTTGCGCGA CATCGCGGAC CAGGTCGCCG GCCGCTCCGG GGAGCGGGAG
ACCCGTCGGT CCGGGCCGGC CGGCGAGGCC TCCGGGGCGA CCCGGCCGTG GGCCTTCGGC
GACACCGAAC CGTGGCACGT GCCGCGGACC CTGCTCAACG CCCAGGTGCG GCGGGCCGGC
GGCGATCCCC GAGTGCTGGA CGTCACCGAT GTCGAGGTCG TGGAGACCGA GCGACGGGCC
CGCGCCGCGG TGGCGCTGTG CGTGGATACC TCCTGGTCGA TGGTGCAGGA GGGCCGGTGG
GTGCCGATGA AACGCACCGC CCTGGCCCTG CACCAGCTGA TCTCCACCCG GTTCCGCGGG
GACGACCTGG CCCTGATCAC CTTCGGCCGG CACGCCGAGA AGGTCGAGCT GGGGCAGCTG
GTCGGGCTGG AGGGTGCCTA CGTGCAGGGC ACCAACCTGC ACCACGCGCT GCTGCTGGCC
GGCGCGCACC TGCGGCGGCA CCCCGATGCC ACCCCGGTGG TCCTGGTGGT GACCGACGGC
GAACCCACCG CGCACCTGGA ACCCGATGGC TCGCCGCACT TCTCGTACCC GCCGGATCCG
GAGACGGTGC ATGCGACGGT GGGCGAACTG GACCGGCTGA CCGGGCTGCG CGCCGCCGTC
ACCTTCTTCA TCCTCGGTGA CGACCCGCGG TTGGCCCTGT TCACCGACAA ACTGGCTCGC
CGCTGCGGCG GCCGGGTGGT CGCCCCCGAC CTGGACGGGC TCGGGGCCTC GGTGGTCGCC
GACTACTTGC GGCATCGGCG ATGA
 
Protein sequence
MTARRRYSYR AYDGGPDPLA PPFDLREAID RIGADVLDGS SPRQALQELL RRGLGERRGL 
DELTRQLWQR RRELQRNNRL DGTLQQVREL LDRALTAERK ALARQDSDDA RFGELQLDAL
PTDAAGAVRE LENYDWQSPD GRAAYEQIRD LLGREMLDQR FAGMKQALEN ATSQDVQAIR
DMLADLNELL SAHARLEDTT ERFGEFMNRH GHFFPEQPRT TDELIDVLAA RSAAAQRMMN
SLSADQRAEL AALTEQAFGD PRLAQSLAQL DAMLQQLRPG QDWDGSGRFR GDNPMGMGEA
TRALEELGRL ESLGEQLAQG YPGASLDDID LDLLQDVLGE QARVDARALA ELERELREQG
LLERAADGSL QLSPKALRRL GQTALRDIAD QVAGRSGERE TRRSGPAGEA SGATRPWAFG
DTEPWHVPRT LLNAQVRRAG GDPRVLDVTD VEVVETERRA RAAVALCVDT SWSMVQEGRW
VPMKRTALAL HQLISTRFRG DDLALITFGR HAEKVELGQL VGLEGAYVQG TNLHHALLLA
GAHLRRHPDA TPVVLVVTDG EPTAHLEPDG SPHFSYPPDP ETVHATVGEL DRLTGLRAAV
TFFILGDDPR LALFTDKLAR RCGGRVVAPD LDGLGASVVA DYLRHRR