Gene Namu_5195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5195 
Symbol 
ID8450826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5790122 
End bp5792245 
Gene Length2124 bp 
Protein Length707 aa 
Translation table11 
GC content69% 
IMG OID645044227 
ProductAlpha-amylase 
Protein accessionYP_003204451 
Protein GI258655295 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGAC CTGCACTGCG CCGACTGGGT ACCGTCGCGC TGGCTGCCAC CCTGGCGGTG 
ACCATCAACC AGGTCGTCGC GCCCGCCGCC GCGGCCGAGA CCACCTCGGT CACCGTCGCC
GGCTCGTTCG ATCAGGAGAT CGGCTGTGCC GACGACTGGC AACCGGCCTG CGCCGCCGCG
CACCTGGACA AGCAACCCAA CGGCCAGTTC GCCAAGACGC TGACCATCCC CGACTCCGGG
TCGGGCGCCA CGAGCTACGG GTACAAGTTC GCCACCAACG ACAGCTGGAA CAACCCGAAC
TTCGGCCTGC GGGGCGGGTC GGGCGACATC GCCCTGCCGA TCCCGGCCGG CGGCCAGAAC
GTCACGTTCG TCTTCGACCC GGTGTCCAAC CTCGGCTCGG ACTCGATCAA CAGCGCCGCC
ATCTCCGGCA CCTTCCAGAG GCAGCTCGGC TGCGCGGCCG ACAACACCCT GTCCTGCCCG
CAGAGCTGGC TCTACGACGA CGACCACGAC GGTGTCTTCA CGTTGGTCAC CGGGGCCATC
GCCCCGGGCA GCTACACGGT GTCGGCGGCC ACCCTGGGCA GCAGCTCGGG GTCGGCGCCG
GTGACCTTCA CCGTGCCGGC GGCCGGCGGC GGGGCGACGA CGACGATCAG CTTCGACCCG
AAGGCCGGCG CGCCCACCGT GTCGGTGGTC CCGCTGGTCC AGCCGGCCCC GGCGACCTTC
AGCCATCCGA CGGCCAAGCG CGGCGACGTG GTGGCCAACC TGTTCGAGTG GAACTGGAAG
TCGGTGTCCA CCGAGTGCCA GACCGTTCTC GGCCCGGCCG GTTACGGCGC GGTCCAGGTC
GCGCCGCCGC AGAACTCCAT CAACAACCCG GACGGGGGCG GCCACCCGTG GTGGGAGGTC
TACCAGCCGG TCAGCTATTC GCTGAACAGC CGGATGGGCA GCGAGGACGA CTTCAAGGCC
ATGGTGAAGA CCTGCCGCGC GGCCGGCGTG CAGGTCATCG TGGACACCGT CATCAACCAC
ATGACGGGCC AGGGCTCGAC CTCCTACGAC CCGGCCACCA CCGGATGGAC CCACACCAAC
TACCCGGGGC TGTATTCGGC CGCGGACTTC CACACCTCGC CGGCCGACTG CCCGGAGGCC
AGCAACACCA TTGACGACTT CAACGACTAC CGGCAGGTCA CCCAGTGCCA GCTGGTCGGT
CTGGCCGATC TGCGGACCGA GTCCGACGCC GTGCGCACGC AGATCGCCGG GTACTTGAAC
AAGCTGCTGT CCTACGGGGT CACCGGGTTC CGGGTGGATG CGGCCAAGCA CATCGGCCAG
GCCGATCTCG CCGCGATCGT CAAGAAGCTC AACCGCACCG TCGACGGCAA GCGCCCGTAC
ATCGCCCTGG AGGTGCCGCC GGGCGGGCCG GGCAAGCTGA CCCCGTTCGC CTTCCAGGAC
CAGGGCAACC TGCTCGGGTT CGATTTCGCC ACCCAGGTCA AGGCCGCCTT CACCAGCAAC
CTCACCGACC TGACGGTGTT CGGGGAGGAT GCCGGACTGC TGCCCAGTGA CCACTCGCTG
GTCTTCGTGC AGAACCACGA CACCGAGCGG GACGGCACGA CGCTGAGCTA CAAGAACGGG
CCGACCAACA CCCTGGCCAC CGAGTTCATG CTGGCCTACG GGTACGGCCG GGCCGAGGTC
TACTCCGGCT TCGTCTTCGC CAACAAGGAC GACTCACCCC CGGCCGACGC CAACGGGTTC
GTCACCGACG CCAACTGCGA CAACGGCTGG GCTTGCACCC ACCGCAGCCG GGGGGTGGCC
AACCTGGTCG ACTTCCACAA CTTCGTCGGC GACGCGCCGG TGCGCAACGT CGACGACGAC
GGGGTCAACC TGCTCGCCTT CAGCCGGGGC AACAAGGGCT GGATCGCGCT GAACAACCAC
GACACCCCGC AGACCCGGAC GTTCTCGACC GGGCTGCTGC CCGGCGTCTA CTGCGACGTC
ATCCACGGCA CCTACAGCCA GGCCTCGCGG GGCAAGAGCT GCGACGGACC GACGGTGACC
GTGGGCTCGT CGGGCAAGGC GACCGTGACC GTTCCGGCCA AGGACGCGGT CGCGTTCAGC
GCCGGGAACC GGGTCGGCCG CTGA
 
Protein sequence
MTRPALRRLG TVALAATLAV TINQVVAPAA AAETTSVTVA GSFDQEIGCA DDWQPACAAA 
HLDKQPNGQF AKTLTIPDSG SGATSYGYKF ATNDSWNNPN FGLRGGSGDI ALPIPAGGQN
VTFVFDPVSN LGSDSINSAA ISGTFQRQLG CAADNTLSCP QSWLYDDDHD GVFTLVTGAI
APGSYTVSAA TLGSSSGSAP VTFTVPAAGG GATTTISFDP KAGAPTVSVV PLVQPAPATF
SHPTAKRGDV VANLFEWNWK SVSTECQTVL GPAGYGAVQV APPQNSINNP DGGGHPWWEV
YQPVSYSLNS RMGSEDDFKA MVKTCRAAGV QVIVDTVINH MTGQGSTSYD PATTGWTHTN
YPGLYSAADF HTSPADCPEA SNTIDDFNDY RQVTQCQLVG LADLRTESDA VRTQIAGYLN
KLLSYGVTGF RVDAAKHIGQ ADLAAIVKKL NRTVDGKRPY IALEVPPGGP GKLTPFAFQD
QGNLLGFDFA TQVKAAFTSN LTDLTVFGED AGLLPSDHSL VFVQNHDTER DGTTLSYKNG
PTNTLATEFM LAYGYGRAEV YSGFVFANKD DSPPADANGF VTDANCDNGW ACTHRSRGVA
NLVDFHNFVG DAPVRNVDDD GVNLLAFSRG NKGWIALNNH DTPQTRTFST GLLPGVYCDV
IHGTYSQASR GKSCDGPTVT VGSSGKATVT VPAKDAVAFS AGNRVGR