Gene Namu_3221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3221 
Symbol 
ID8448835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3550059 
End bp3551300 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content70% 
IMG OID645042300 
Productamidohydrolase 
Protein accessionYP_003202541 
Protein GI258653385 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000027233 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000156349 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGCCTC GGATGATCGC ACTTCGTTCG TCCTCGCTGT TCGACGGCTC GGTGTTCTTC 
GACGACGGAG TGACCGTCGT CGTCGACGGC GAGTCGATCG CCGGCGTGCT CCGCGGGCAT
CCCGACCTCG GGTCGGACGT CGAGGTGATC GAGTTGGGCG AGGCCACCGT GCTGCCCGGT
CTCATCGACA CCCATGTCCA TCTGGTCGCC GGCAGCGGGG TGCGCGCCCT GGATCTGGTG
GAGGGCTACT CGGACCAGGA GATCGAGGCC GTGGTCACCC GGTCCCTGGC CGCGCACCTG
GCGGCCGGGG TGACGACGGT CCGCGACCTG GGGGACCGGC GGTTCGTGGT GGTCAATCGC
CGGGACGACC AGCATGCCCG CCCGCTGACC ACTGCCCGGC CGTGGACACC GACCATCCTG
GCCGCCGGAC CACCGCTGAC CACGCCCCGG GGCCATTGTC ACTACCTGGG CGGCGAGGTG
TCCGGTCCGG TGGAGATCGA GGCCGCGGTG CAGGAGCGGA TCGACCGCGA GGTGGACGTG
GTCAAGGTGA TGGCCAGTGG CGGGATGGCC ACTACCGGCA CCGACGTGAT GATGCCGCAG
TTCTCCCTGG CCGAGATGCG GCTGATCGTC GACCTGGCGC ATGCCGCCGG GATCGCCGTG
ACCGCGCACG CGCATGCCCT GCCCGCGGTG GAGATCGCGC TCGCCGCCGG GGTCGACGGG
CTCGAGCACT GCAGTTGCCT GACCCCGCAG GGGCCACGGG TGTCCGACGA ACTGCTCGCG
GTGCTGGCCG AACGTCAGGT GCCGATCGGG GCGGCTCTGA TGGCCCCACC ACCGGAAGCG
TTCGAGCACG CTCCGCCCAA TATCAAGAAG GTGATGGCCC AGATGGGCAT GACCCCGGAG
ACGATGCTGG AAAGCCGGCG GTCGATGGTG GGCCGGATGC ACGCGGCCGG GGTCCGGTTC
GTCGGCGGTT CGGACGCCGG GATCGAGCCG TTCATGGCCC ACGGCCTGAT GCGCTCGGGT
CTGGGCTTTC TGCTCTCCGC CGGGGCGTCG GTCAGTCAGA CCCTGGCCGC CGGCACCTCG
CTGGCCGCTG CCGCTTGCGG GCTGACCCGA AAAGGGTTCC TGCGTCAGGG TTTCGACGCC
GATCTGGTCG TCGTCGACGG ACGCTTCGAT TCCGACCTGG CGCCGCTGGC GCAGGTGCGT
CAGGTCATGC TCGGTGGTCG GTTCGCCCCG GCCGGGCTAT GA
 
Protein sequence
MTPRMIALRS SSLFDGSVFF DDGVTVVVDG ESIAGVLRGH PDLGSDVEVI ELGEATVLPG 
LIDTHVHLVA GSGVRALDLV EGYSDQEIEA VVTRSLAAHL AAGVTTVRDL GDRRFVVVNR
RDDQHARPLT TARPWTPTIL AAGPPLTTPR GHCHYLGGEV SGPVEIEAAV QERIDREVDV
VKVMASGGMA TTGTDVMMPQ FSLAEMRLIV DLAHAAGIAV TAHAHALPAV EIALAAGVDG
LEHCSCLTPQ GPRVSDELLA VLAERQVPIG AALMAPPPEA FEHAPPNIKK VMAQMGMTPE
TMLESRRSMV GRMHAAGVRF VGGSDAGIEP FMAHGLMRSG LGFLLSAGAS VSQTLAAGTS
LAAAACGLTR KGFLRQGFDA DLVVVDGRFD SDLAPLAQVR QVMLGGRFAP AGL