Gene Namu_4031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4031 
Symbol 
ID8449650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4445319 
End bp4446560 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content70% 
IMG OID645043076 
Productalpha amylase catalytic region 
Protein accessionYP_003203312 
Protein GI258654156 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0102813 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGTGA CGACGGATCG ATCCTTCGAA CAGGTCATCT GGTGGCAGGT ATATCCGCTG 
GGTTTTGGCG GCGCCCCGAT TCGGCAGGCC CACACCCCCG GTCACCGGCT GCGCCGGCTG
CTCGGCTGGT TGGACGAGGT GGTGGAGCTG GGCTGCACCG GGCTGCTGCT CGGGCCGATC
TGCGCGTCGG CGACGCACGG GTACGACAGC GTCGATCTGC GGCGGATCGA CCCGCGCCTG
GGCACCGAGC ACGACTTCGA CGACCTGGTC GCCGGCTGCC GATCCCGGGG CCTGCGCCTG
CTGCTGGACG GGGTGTTCAG CCACGTCGGC CGCGATCATC CGCTGGTCGC GCAGGCGCTC
GCCGAAGGCC CCGACAGTGC CGCGGCCGCC CTGTTCGACA TCGACTGGTC CGACCCGGCC
GACCCGCACC CACGGGTGTT CGAGGGCCAC GACACCCTGG TCCGGCTGAA CCACTCCGGT
GACGCCGCCG CCGACTGGGT CACCGACGTG CTGATCCACT GGTTGGACCG CGGCGCCGAC
GGCTGGCGGC TGGATGCGGC CTATTCGGTG CCGCCCCCGT TCTGGGCCCG CGTGCTGGCC
GCCACTCGCG AGAAGCACCC CGACGCCTGG TTTCTCGGCG AGGTCATCCA CGGCGACTAC
CCGGATTTCG TGACCCGCTC CACCGTCGAC TCGGTGACCC AGTACGAGCT GTGGAAGGCG
ATCTGGTCCT CGCTCAAGGA CGGCAACTTC TTCGAGCTGG ACTGGACGCT GCAGCGGCAC
AACGACTTCC TGGATCACTT CCGGCCCAAC ACCTTCATCG GCAACCACGA CGTCACCCGG
ATCGCCTCTC AGGTCGGCCC GACGCTGGTG CCGGTCGCGC TGACGATCCT GCTCACCGTC
GGCGGCATCC CGTCGATCTA CTACGGCGAC GAGCGCGGTT TCACCGGGGT CAAGCAGGAC
CGGCTGGGTG GCGACGACGC GGTCCGGCCC GAGTACCCCG ACTCGCCCGC CGATCTGCCC
CGCAACGATC TGTGGCGGAT GCACGCCGGG CTCATCGACG TGCGCCGGAG CCGGCCCTGG
TTGGCCGGCG CGTCCACCGA ATCGCTGGAG CTGACCAACA CCCGCTACCG GTACCGCGCA
TCCGGCGACG GCGAGCACCT GGACGTCGAG CTGGACCTGG ACCGGCCGTC GGTGCTGATC
CGGGACGCCG GCGGCGGGAC GATCTGGGAG CACGCCGGCT GA
 
Protein sequence
MDVTTDRSFE QVIWWQVYPL GFGGAPIRQA HTPGHRLRRL LGWLDEVVEL GCTGLLLGPI 
CASATHGYDS VDLRRIDPRL GTEHDFDDLV AGCRSRGLRL LLDGVFSHVG RDHPLVAQAL
AEGPDSAAAA LFDIDWSDPA DPHPRVFEGH DTLVRLNHSG DAAADWVTDV LIHWLDRGAD
GWRLDAAYSV PPPFWARVLA ATREKHPDAW FLGEVIHGDY PDFVTRSTVD SVTQYELWKA
IWSSLKDGNF FELDWTLQRH NDFLDHFRPN TFIGNHDVTR IASQVGPTLV PVALTILLTV
GGIPSIYYGD ERGFTGVKQD RLGGDDAVRP EYPDSPADLP RNDLWRMHAG LIDVRRSRPW
LAGASTESLE LTNTRYRYRA SGDGEHLDVE LDLDRPSVLI RDAGGGTIWE HAG