Gene Namu_4102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4102 
Symbol 
ID8449725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4529061 
End bp4530212 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content72% 
IMG OID645043148 
ProductAminocarboxymuconate-semialdehyde decarboxylase 
Protein accessionYP_003203380 
Protein GI258654224 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.13287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0312348 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGGTG CCGCCGGGCG GTGGCACAGC GACGCCGGCG CGGAGCCGGC CGTCCAACCG 
AACACGTGGG AGGTCGCGTT GACGAGCTCG AATCCGGCGG ACACGTCCGT CACTGCAGGA
TCGACGGCGG CCGGATCGAC GGCGGCCGCA CCGACCGGGC CGCGGGTGGT CGACGTCCAT
GCGCATGCGA TGCCGATGCC GCTGCTGCGC AGCCTGGCCG ACCGCGGCCT GGCCGACGTC
TCCGCGGTCG ATCAGGGCAT CGTCCGGCTC GACCCGAAGG TCAGCGGCGT GGGTCCGGGA
GCTCCGCTGC CGCTGGCCCG CTCCCAGCAC GACGTGGCCA CCCGGCTGGT CGAGATGGAC
GCGGCCGGCG TCGACGTGCA CGCGGTGTCG TTGCCGCCGT TCCTGTTCGC CACCAACGCC
GACGACGCGG GCTTCGCCAC CGGCATCGTG GCCCAGGGCA ACGACGAGCT GGCCGGCTAC
GTCGCCGGCG CCCCCGATCG GTTGGTCGGC CTGGGCTACG TGCCGCTGGG CTGGCCCGGG
GTGGCCGACG AGGCGGTACG CGTGCTCGAC GAGCTGGGCC TGGCCGGCAT CGCGATCGGC
AGCCAGGGCG GCGGCAAGGA TCTGGACGAT CCGGTGAACG AGGATCTGTG GGCGTTGCTG
GCCGAGCGGA ACACCTTCGT GTTCCTGCAC CCGTCGGGCA TGCCGGCCGG TCCGCGACTC
AAGGACTACT GGATGCCGCA GCTGGTCGGG TATCCGATGG AGACGGCGAT CGCGGTGGCC
CGGCTGGTGT TCAGCGGCAC CCTGGAGCGG TACCCGATCA CCCTGTGCCT GGCCCACGGC
GGCGGCTGCG TGCCCTCGCT GCGCGGGCGG ATGGACATGG GCTGGGAGCG CAAGGACGTC
GCCCACACCA ACGACCACCC GCCGACCCAC TACACCGATC GGCTCTACTA CGACACGGCG
GTGTTCAACA CGACCGTGCT GAGCCGGATC GTGCAGGACG TGGGCGTCGA GCACGTGCTG
ATGGGTACCG ACCACCCGTT CGAGCTGGGC GATCCGACGC CGCGAAAGAC CGTGGGCGAC
CTGGGGCTGA GCGAGGCGGA CACCGCGGCC ATCCTGGGCG GCACGGCCAG CCGGTTGCTC
GGGTTGGCCT GA
 
Protein sequence
MIGAAGRWHS DAGAEPAVQP NTWEVALTSS NPADTSVTAG STAAGSTAAA PTGPRVVDVH 
AHAMPMPLLR SLADRGLADV SAVDQGIVRL DPKVSGVGPG APLPLARSQH DVATRLVEMD
AAGVDVHAVS LPPFLFATNA DDAGFATGIV AQGNDELAGY VAGAPDRLVG LGYVPLGWPG
VADEAVRVLD ELGLAGIAIG SQGGGKDLDD PVNEDLWALL AERNTFVFLH PSGMPAGPRL
KDYWMPQLVG YPMETAIAVA RLVFSGTLER YPITLCLAHG GGCVPSLRGR MDMGWERKDV
AHTNDHPPTH YTDRLYYDTA VFNTTVLSRI VQDVGVEHVL MGTDHPFELG DPTPRKTVGD
LGLSEADTAA ILGGTASRLL GLA