Gene Namu_3302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3302 
Symbol 
ID8448917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3629267 
End bp3630967 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content71% 
IMG OID645042381 
Productalpha amylase catalytic region 
Protein accessionYP_003202621 
Protein GI258653465 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.15105 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0149256 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCG AGGAAACCGT GGCCTTCAGC GACATCACGG CCGACCTGGA GTCGACCCCC 
GGCCAGGACT GGCGCCGCAC CGCGGTCGTC TACCAGATCT ACCCGCGCTC GTTCGCCGAC
TCCAACGGTG ACGGGATCGG CGACCTGCCC GGGATCAACC ATCGCCTGCC GGCCCTGGCC
GAGCTGGGCG TGGACGCCAT CTGGCTGTCC CCCTTCTACA AGTCCCCGCA GGCCGACGCC
GGGTACGACG TGGCCGACTA CCGCGACGTC GACCCCGTTT TCGGCACCCT GGCGGACTTC
GACGCGATGC TCGAGCGGGC CCACGGGCTG GGTCTGAAGG TGATCGTCGA CCTGGTGCCC
AACCATTCCT CCGACGAGCA CGTCTGGTTC CAGCAGGCCC TGGCCGCGGC CCCCGGCTCC
CCCGAGCGGG ACCGCTACGT CTTCCGCGAG GGACGCGGCG AGCACGGCGA GCTGCCGCCG
AACAACTGGG AATCCTGCTT CCGCGGGCCG GCCTGGACCC GGACCACCAA CCCCGACGGC
ACCCCCGGCC AGTGGTACCT GCACCTGTTC GACACCAAGC AGCCCGACTG GAACTGGGAG
AACCCGCAGG TTCGGGCCGA GTTCCTGGAC GTGTTGCGGT TCTGGCTGGA CCGCGGCGTG
GACGGCTTCC GGGTGGACGT GGCCCACGCG CTGATCAAGG CCCCGGGCCT GCCCGACATG
AAGGCCTCCG ACGAGGTCGT CGAGGACAGT GAGGGCTACT TCCACACCGG CCCCATGTGG
GACCAGGACC GGGTGCACGA GGTCTACCGG GAGTGGCGGG CCCTGCTGGA CACCTACTCC
CCCGACCGGA TCCTGTGCGC CGAGGCGTGG GTGCCCTCGC TGTCCCGGCT GGCCCGCTAC
GTCCGCGAGG ACGAGATGCA CCAGGCGTTC AACTTCGACT ACCTGGAGAG CGAGTGGGAC
GCCACCCACC TGCGCTCGGT GATCGATTCC TCGGTCGCCG CCAACGACGA GGTCGGCGCG
CCCACCACCT GGGTGCTGTC CAACCACGAT GTGGTCCGCC ATGTCTCGCG GCTGGGCCTG
CCGGCCGGTC CGCGGCCCAA CGGCATCCGG GCCCAGGATC AGCAACCCGA CTACCAGCTC
GGCCTGCGCC GGGCCCGGGC GTCGACCCTG TTGATGCTGG CCCTGCCCGG TTCGGCCTAT
ATCTACCAGG GCGAGGAGCT CGGGCTGCCC GACCACACCG AGCTCGACGA CGACCTGCGT
CAGGACCCGA CCTGGTGGCG CTCCGGTTAC ACCGAGGCCG GCCGCGACGG CTGCCGCGTG
CCGCTGCCCT GGGAGGCCGG CGAGCCCGGC CTGGGCTTCG GACCGGGCGG CGCGACCTGG
CTGCCGCAGC CCGCGTCCTA CGCGGAGCTG GCTCGTGACA AGCAGGAGGA CGTCGAGGGT
TCGACCCTGG AGATGTACCG CACCGCGCTG GCGTTCCGGC GCGCGTTCGC GCTCGCCGTC
GGCGACCTGG AGTGGGTCGA CGCCGAGCCG GGCGTCGTGC GGTTCGTCAA CGGTGAACTG
ACCATCGCCG CCAACACCGG GACCGAAGCG GTTCCGATGC CGGTCGGCGA GCTGCTGATG
GCCTCCGGCG AGCTGACCGA TCACTCGGTG CTCCCGCCCG ACACCACGGT CTGGTTGCTC
ACCGAGGTCT TCAGCGAGTA G
 
Protein sequence
MTIEETVAFS DITADLESTP GQDWRRTAVV YQIYPRSFAD SNGDGIGDLP GINHRLPALA 
ELGVDAIWLS PFYKSPQADA GYDVADYRDV DPVFGTLADF DAMLERAHGL GLKVIVDLVP
NHSSDEHVWF QQALAAAPGS PERDRYVFRE GRGEHGELPP NNWESCFRGP AWTRTTNPDG
TPGQWYLHLF DTKQPDWNWE NPQVRAEFLD VLRFWLDRGV DGFRVDVAHA LIKAPGLPDM
KASDEVVEDS EGYFHTGPMW DQDRVHEVYR EWRALLDTYS PDRILCAEAW VPSLSRLARY
VREDEMHQAF NFDYLESEWD ATHLRSVIDS SVAANDEVGA PTTWVLSNHD VVRHVSRLGL
PAGPRPNGIR AQDQQPDYQL GLRRARASTL LMLALPGSAY IYQGEELGLP DHTELDDDLR
QDPTWWRSGY TEAGRDGCRV PLPWEAGEPG LGFGPGGATW LPQPASYAEL ARDKQEDVEG
STLEMYRTAL AFRRAFALAV GDLEWVDAEP GVVRFVNGEL TIAANTGTEA VPMPVGELLM
ASGELTDHSV LPPDTTVWLL TEVFSE