Gene Namu_1272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1272 
Symbol 
ID8446868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1396376 
End bp1397947 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content73% 
IMG OID645040406 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_003200665 
Protein GI258651509 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0665315 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.630198 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCC AGGACGGGCG TCGGCCCATC CGGCGGGCCC TGCTGTCGGT GTCGGACAAG 
TCCGGCCTGC TGGAGTTGGC CGCCGCGCTG CACGCCGCCG GGGTGAGCAT CGTGTCCACC
GGCGGGTCGG CCCGGGCGAT CGCCGACGCC GGCATCCCGG TCACCCCGGT CGAGCAGGTG
ACCGGCTTCC CGGAGTGCCT GGACGGCCGG GTCAAGACCC TGCACCCGGC GATCCACGGC
GGCCTGCTGG CCGACACCCG GCTGCCCGAC CACCTGCGCC AGGCCGACGA GCTGGGCATC
GAGCTGTTCG ACCTGGTGGT GGTCAACCTC TACCCGTTCC GGCAGACCGT GCGCTCGGGC
GCCTCGTTCG ACGAGTGCGT CGAGCAGATC GACATCGGTG GCCCGGCCAT GGTCCGGGCC
TCGGCCAAGA ACCACCCGTC GGTGGCGGTG GTGGTCGACC CGGCGCGCTA CCCCGACATC
GAGCAGGCGC TGGCCGACGG CGGGTTCACC CTGGCCCAGC GGGCGGCGCT GGCCGCGGCC
GCGTTCGCGC ACACCGCCGC CTACGACATC GCCGTCGCCT CCTACCTGGG CGGGGCGACG
ACCGGAGCCG ACGGCTGGCC CGAGTTCACC GGGGCGAGCT GGGACAAGAT GTCCACGCTG
CGCTACGGGG AGAACCCGCA CCAGGCGGCG GCGCTGTACC GGCACTGGCG GGTCGGGCTG
GCCTCGGCCG AGCAGCTCCA CGGCAAGGAA ATGAGCTACA ACAACTACGT CGACGTGGAC
GCCGCCTGGC GGGCCGTCGG CGACTTCGCC GATCCGGCCG TGGCCGTGGT CAAGCACGCC
AATCCCTGTG GCATCGCCGC GGTGACCGGT GGCGCCGACG ACACGATTGC CCGGGCCCAC
CGGCTCGCCC ACGCGTGCGA CCCGGTGTCG GCCTTCGGCG GGGTGATCGC CGCGAACCGC
CCGGTGACCA TGGAGATGGC CGAGCAGATC GCCGACGTGT TCACCGAAGT GGTGCTCGCC
CCGGCTTTCT CGGCCGACGC GGTGACGGTG CTGACCCGCA AGAAGAACAT CCGGCTGCTG
GTCATGCCGG AGGGCGCCGC GCCCGATCCG ATCGAGTTCC GGTCGATTTC CGGCGGGGTG
CTCGCGCAGC GCCGGGACCA GCTGGACGCC CCCGGCGACG ATCCGGCGAC CTGGACCCTG
GCCGCCGGCC CGGCGGTGGA CGAGGCGACG CTGGCCGACC TGGTCTTCGC CTGGCGGGCC
TGCCGATCGG TGAAGTCCAA CGCCATCCTG CTGGCCGCCG ACGGCGCGTC GGTGGGCATC
GGCATGGGGC AGGTCAACCG GGTCGATTCG GCTCGGCTGG CCGTGGAACG GGCGGGGGAG
CGGGCCCGTG GTTCGGTCGC CGCGTCCGAC GCGTTCTTCC CGTTCGCCGA CGGCCCGGAG
ATCCTGATCG CCGCCGGGGT GCGGGCGATC GTGCAGCCCG GCGGTTCGGT CCGCGACCCC
GAGGTCATCG CGGCGGCCGA GCAGGCCGGG GTCAGCATGT ACTTCACCGG GACCCGCCAC
TTCTTCCACT GA
 
Protein sequence
MSTQDGRRPI RRALLSVSDK SGLLELAAAL HAAGVSIVST GGSARAIADA GIPVTPVEQV 
TGFPECLDGR VKTLHPAIHG GLLADTRLPD HLRQADELGI ELFDLVVVNL YPFRQTVRSG
ASFDECVEQI DIGGPAMVRA SAKNHPSVAV VVDPARYPDI EQALADGGFT LAQRAALAAA
AFAHTAAYDI AVASYLGGAT TGADGWPEFT GASWDKMSTL RYGENPHQAA ALYRHWRVGL
ASAEQLHGKE MSYNNYVDVD AAWRAVGDFA DPAVAVVKHA NPCGIAAVTG GADDTIARAH
RLAHACDPVS AFGGVIAANR PVTMEMAEQI ADVFTEVVLA PAFSADAVTV LTRKKNIRLL
VMPEGAAPDP IEFRSISGGV LAQRRDQLDA PGDDPATWTL AAGPAVDEAT LADLVFAWRA
CRSVKSNAIL LAADGASVGI GMGQVNRVDS ARLAVERAGE RARGSVAASD AFFPFADGPE
ILIAAGVRAI VQPGGSVRDP EVIAAAEQAG VSMYFTGTRH FFH