Gene Namu_3957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3957 
Symbol 
ID8449576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4366193 
End bp4368049 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content70% 
IMG OID645043002 
Productvon Willebrand factor type A 
Protein accessionYP_003203238 
Protein GI258654082 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.136708 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCTATC CGCCGGGTCA GGCCGGGCCG CGCAAGCGCA GCAACGTCGT GCCGATCGTC 
GCCGCGGTGA TCGCCGGCGT GCTGCTGATC GTCGGCATCC GCTGGTTCAC CACCCGGGGC
GACGATTCGA CCTCCGGCTC CGGTCCGACC ACCACCGCCA CCGGCGCGCC ACCGCCACGC
GACGGCTGCA CCCGGGTGAC CGTCGCCGCC TCCAGCGAGA AGGCGGCGCT GCTGCAGCAG
ATGGCGCAGA CCTACCACTC GTCCGGGCGC ACCGTCGACG GCAAATGTTT CGACGTGCAG
GTGAACTCGG TGGCCTCCGG CACGGCCGAG GCCAACCTCG CGCAGGGCTG GGACGAGGCA
CTGGACGGCC CCGCACCGGA CGCCTGGACG CCCGCCGCCT CCACCTGGGT CAGCCTGCTG
GCCAGCGATC TGACCGCCAA GGACCGGCCC ACCATCCTTC CGGCCGAGGC GGCGAAGTCG
ATCGTCTCGA CGCCGCTGGT GCTGGCCATG CCCGAGCCGA TGGCCAGGGC ACTAGGCTGG
CCGGACGCGC AGATCGGCTG GTCGGACGTG CTGGCGCTGG CCAAGGACCC GCAAGGGTGG
GCGGCCAAGG GCCACCCGGA ATGGGGCAGG TTCACCCTCG GCAAGACGAA CCCGACCGTG
TCCACCTCCG GGCTGGCCGC CACCATCGGC ACGCTGGTCG CGGCCACCGG TACGTCGTCG
GACCTGACCG AGGCGGCCCT GCAGCGGCCG GAGGTGCAGC AGTACCTCAA GGACGTCGAG
ACCGCCGTCA TCCACTACGG CGACACCACG CTGACCTACC TGACCAACCT GCAGCACGCC
GACGACTCCG GCGCGGCCCT GGGCTACGTG TCCGCGGTGG CCGTGGAGGA GAAGAGCGTT
CTGGACTACA ACGCCGGCAA CCCGAGCGGG AACCCGGCCA CCCTGGGCGA CCACGCACCG
CCGAAGGTGC CGCTGGTCGC GGTGTACCCG AAGGAGGGCA CGCTCTACAG CGACAGCCCG
TTCGTCATCC TCGACGCCCC GTGGTCGACC GCCGACAAGC AGGCCGGCGC GCAGGACTTC
ATGGAGTTCC TGCTGCTGCC CGAGCAGCAG AAGGTGTTCA CCGAGGCGAA CTTCCGCACC
GCGGACCACC AGCCCGGCGA ACCGATCACG TCGAGTCCGT ACCTGATCGC GGACGGCGTG
ACGATCGCGC TCAACCCGCC GGGCCCGTCG GTCCTGCGCG ACGTCCGAGC CCTCTGGACG
CAGGTCCGCA AGCCCGCCCG GGTCCTGGTG GTGATGGACG TGTCCGGGTC GATGGCCAGC
GAGTCGGGGT ACGGCAGCGA GTCCAAGCTC GACCTGGCCA AGAAGGCCGC GACATCGGCG
TTGGGTCAGC TGACCGACAC CGATCAGATG GGGCTGTGGG CGTTCACCAC CGACCTGCCC
ACCCCGGACA CGATCACCGC CGACCTGGTC GGTGTCGGGC CGCTGGCGCA GACCCGGCAG
CCGATCATCG ACGCGATCTC CAGCCTGACC CCGCTGAACG GCACCCCGCT GTACGCGGCG
ACGCGGGAGG CGGCGAAGGC GATGAACGCG CAGAAGGATC CGAACTCGAT CAACGCGGTG
GTCGTGCTGA CCGACGGTCG CAATGAGTAC ACCGACAACG ATCTGGACGG TCTGCTGCGC
GAGCTGAACG CGAGCGCCGA GGAGGACGGG GTGCGGGTGT TCACCATCGC CTACGGTCCG
GATGCGGACC TGGCCACCCT GCAGGAGATC TCCGAGGCGT CCCGGGCCGC CGCCTACGAC
GCGCGGAACC CGACGAGCAT CGACAAGGTG TTCTCCGACG TGCTGTCCAA CTTCTGA
 
Protein sequence
MSYPPGQAGP RKRSNVVPIV AAVIAGVLLI VGIRWFTTRG DDSTSGSGPT TTATGAPPPR 
DGCTRVTVAA SSEKAALLQQ MAQTYHSSGR TVDGKCFDVQ VNSVASGTAE ANLAQGWDEA
LDGPAPDAWT PAASTWVSLL ASDLTAKDRP TILPAEAAKS IVSTPLVLAM PEPMARALGW
PDAQIGWSDV LALAKDPQGW AAKGHPEWGR FTLGKTNPTV STSGLAATIG TLVAATGTSS
DLTEAALQRP EVQQYLKDVE TAVIHYGDTT LTYLTNLQHA DDSGAALGYV SAVAVEEKSV
LDYNAGNPSG NPATLGDHAP PKVPLVAVYP KEGTLYSDSP FVILDAPWST ADKQAGAQDF
MEFLLLPEQQ KVFTEANFRT ADHQPGEPIT SSPYLIADGV TIALNPPGPS VLRDVRALWT
QVRKPARVLV VMDVSGSMAS ESGYGSESKL DLAKKAATSA LGQLTDTDQM GLWAFTTDLP
TPDTITADLV GVGPLAQTRQ PIIDAISSLT PLNGTPLYAA TREAAKAMNA QKDPNSINAV
VVLTDGRNEY TDNDLDGLLR ELNASAEEDG VRVFTIAYGP DADLATLQEI SEASRAAAYD
ARNPTSIDKV FSDVLSNF