Gene Namu_4901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4901 
Symbol 
ID8450531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5467382 
End bp5468374 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content75% 
IMG OID645043939 
ProductAminocarboxymuconate-semialdehyde decarboxylase 
Protein accessionYP_003204164 
Protein GI258655008 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCACCG CGGTGGCCGG CGGAGTGGTC GACGTGCACG CGCACTGGTT GCCCCGGGAG 
CTGTTCACGT TGCCGCCCGG CGCACCGTAC GGCGCGCTGA CCGACCGGGC CGGCGAGCTG
CATCTGGGCG AGGTGCCGCT GTCCATCGCG GCGACCGCGC TGAGCGACGT GCCGGCCATC
CGGGACGACA TGCGCCGTGC CCGGGTCGGG GTGCGGGTGC TCTCCGCGCC GCCATTCGCC
TTCCCGGTGG GCGACGCCGG CGCGGGGGCC GACGCGGGTG ACTACGTCGC CTCCTTCAAC
GAGTCGCTGG CCGCCGTGGT CGGCGAATCC GACGGCGCGC TGGCCGGTCT CGGACTGGTC
GGGCTGCACG ACCCGGACCG GGTCCGCGAG GAGCTGGCCA CGTTGGCCGT CACGCCCGGC
ATCGCCGGGG TGGCCATCCC GCCGCTGCTG CGCGGCGACT CGCTGGACCG CGGGGTCCTT
CGCGAGGTGG TGGTCGGCGC CGCCGAGCTC GACCTGGCCG TGCTCGTGCA CCCGATGCAG
CTGCCCCGGC CGGAATGGTC GTCGTACTAC CTGGCCAACC TGATCGGCAA CCCGACCGAG
ACGGCCACCG CGGTGGCCTC GCTGCTGCTG TCCGGCCTGG CCGAGGAGCT CCCGCTGCTG
CGCATCTGCT TCGTGCACGG CGGGGGCAGC GCCCCCGCCC TGCTCGGCCG GTGGGAGCAC
GCCTTCACCC GCCGGGCCGA CGTCGCCCGG TCGGCCAAGC GCGGACCCCG CGAGGGCTTC
CGGGAGCTGT TCCTGGACAC CGTCACGCAT GACCCGGACG CACTGGATCT GCTGGTCGCA
CAGGCCGGCG ATGGCCGGAT CGTGGCCGGC AGCGACTACC CGTTCGACAT GGCCCAACCC
CATCCCGTCG CCTTCGCCGT GGACAACGGC CTGCCCGCCG CCACGCTGGC GGCCAGCGGC
CGGGCGTTCC TCGGCCTGAC CCCGGCCCGG TGA
 
Protein sequence
MITAVAGGVV DVHAHWLPRE LFTLPPGAPY GALTDRAGEL HLGEVPLSIA ATALSDVPAI 
RDDMRRARVG VRVLSAPPFA FPVGDAGAGA DAGDYVASFN ESLAAVVGES DGALAGLGLV
GLHDPDRVRE ELATLAVTPG IAGVAIPPLL RGDSLDRGVL REVVVGAAEL DLAVLVHPMQ
LPRPEWSSYY LANLIGNPTE TATAVASLLL SGLAEELPLL RICFVHGGGS APALLGRWEH
AFTRRADVAR SAKRGPREGF RELFLDTVTH DPDALDLLVA QAGDGRIVAG SDYPFDMAQP
HPVAFAVDNG LPAATLAASG RAFLGLTPAR