Gene Namu_1270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1270 
Symbol 
ID8446866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1393796 
End bp1395577 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content76% 
IMG OID645040404 
Productvon Willebrand factor type A 
Protein accessionYP_003200663 
Protein GI258651507 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.432707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC AGTCGCCCCC GGCCGCGCAC GGCCGGCACC GCAGGTCCCC GGCCCGCCGC 
CGCACCCTGA CCGTGGTGGC CGTGGTGGTG GCCCTGGTGG CCGCGGCCGG CGTCATCACC
TGGCTGGTGC GGTCCCGGTC GTCGACCGAT CCCGCCGCCG GCCCGGCGGC GGTGACAGTG
ACGGCGGCGA GCCCGACCGG CAGCCCGTTC GACGGCGCGG CCGCCAGTGC CCCCGTGAGT
GCGGGCGCGA GGTGCCCGGG TGAGCCGCTG ACCGTCGCGG TCACGCCCGA TCTCGCCCCC
ACACTGACCG CCTTCGCCGA GCGGCAGGAC CTGACCGTGG CCGGATGCCC GGTGCGGATC
GACGCCGTCG ACCCGGCCCA GGTCGTCGAC GGATCGGCGA CCGCCGACGT GTGGATCCCC
GACTCGTCCA GCTGGCTGCC CCGGGCGACG GCCGCCGGGC GCACCGTCGG GCCGGACGCC
CCGTCCATCG CCACCAGTCC CGTCGTGTTC GCCCTGTCCG GTCAGGCCCA GCAACAGCTG
GCCGCGGCCG GTGCATCGAC CGATGTCGCC GGCCTGCTGG CCACCCGCAA GACGGCCGCG
CCGATCCGGG TGGGCCTGCC CGACCCGCAA CGGTCGGCGG CCGCGGTGGC GGCCACCCTG
TCCGCCCGGG CCGCGGTCAG CGGCGCCACC GACGCCCGGC CCGCCCTGAC CTGGGCGGTC
CGCTCCAGCC CGGCCGACCT GCCGGTCGAC GACGCTCAGC TGCTGGCCCG CCTAACGTCC
GATCCGGGCA CCGCGGTGCC GGTCACCGAG CAGTCGCTGC TCGCCTGGGA TCAGGACCAT
CCGGACTCAC CCGCCCGGGC GCTCTACCCC GGACCCGGCG GGTTCGCCAT GGACTTTCCC
GTCGTCGCCG TCGGCGGCGA CCCGGCCGCC ACGGCCGCCG CCCGCGAGCT GGCCACCGCC
TTGACCACCG AACCGGCCCG CACCGCCCTG CTGGCGGCCG GTTTCCGCGC CCCGGATCAG
ACTCCGGGGC CGGCGATCAG CGCCGCCGGG GCCGCCAGCG GCATCGACCC GGCGTACCGG
GAGACGTCGG ACCCGCCCAC CCCGCAGGCC GTCGACGACG CCATCCGCAG CGTCCAGGTG
ACCAACGAAG GCACCCGGAT GCTGGCCGTC ATGGACATCT CCGGGTCGAT GCTGGCCCAG
GTGCCGGGCA CCAACGGCGC CGACCGGATC GACCTGGCCA AGGACGCCGC CGCTCGCGGC
CTGGGCCTGT ACCGGGCGGA CAGCGACATC GGCCTGTGGG AGTTCTCCAC CCGGCTCAGC
CCGACCAGCG ACCACCGCGA GCTCATCCCG ATCAGCTCGC TCGGGCCGGA CGGGCAGGGC
AGCACCGGTG CCGCCCGGCT GGCCGCCGCG CTGAACGGGC TGCAGGCCAT CCCCGACGGC
GGTACCGGCC TGTACGACAC CGTGCTGGAT GCGACCCGGA CCGTGCGGGC CGGCTACGAC
CCCGACCGGG TCAACGTGGT GCTGCTGCTG ACCGACGGGA TGAACGACGA CGTCAACAGC
ATCACCATGG ACCAGTTGCT CAGCACCCTG GCCGCCGAGC AGGACCCGGC CCGGCCGGTA
CCGGTGATCT CGATCGCCTT CGGCCCGGAC AGCGACGTGG CCGCGCTCCA GCAGATCAGC
CGGGCCACCG GTGGGGCCAC CTACCTGTCG CAGGACCCCC GGCAGATCGG CGAGATCTTC
CTGGACGCGG TGGGCCAGCG TCTGTGCCGG CCCAGCTGCT GA
 
Protein sequence
MTDQSPPAAH GRHRRSPARR RTLTVVAVVV ALVAAAGVIT WLVRSRSSTD PAAGPAAVTV 
TAASPTGSPF DGAAASAPVS AGARCPGEPL TVAVTPDLAP TLTAFAERQD LTVAGCPVRI
DAVDPAQVVD GSATADVWIP DSSSWLPRAT AAGRTVGPDA PSIATSPVVF ALSGQAQQQL
AAAGASTDVA GLLATRKTAA PIRVGLPDPQ RSAAAVAATL SARAAVSGAT DARPALTWAV
RSSPADLPVD DAQLLARLTS DPGTAVPVTE QSLLAWDQDH PDSPARALYP GPGGFAMDFP
VVAVGGDPAA TAAARELATA LTTEPARTAL LAAGFRAPDQ TPGPAISAAG AASGIDPAYR
ETSDPPTPQA VDDAIRSVQV TNEGTRMLAV MDISGSMLAQ VPGTNGADRI DLAKDAAARG
LGLYRADSDI GLWEFSTRLS PTSDHRELIP ISSLGPDGQG STGAARLAAA LNGLQAIPDG
GTGLYDTVLD ATRTVRAGYD PDRVNVVLLL TDGMNDDVNS ITMDQLLSTL AAEQDPARPV
PVISIAFGPD SDVAALQQIS RATGGATYLS QDPRQIGEIF LDAVGQRLCR PSC