Gene Namu_4218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4218 
Symbol 
ID8449844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4662984 
End bp4664246 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content73% 
IMG OID645043267 
Productphosphoribosylaminoimidazole carboxylase, ATPase subunit 
Protein accessionYP_003203496 
Protein GI258654340 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0535465 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.208178 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGAAC GCACCGGCAT GCCCCGGGTC GGGATGGTCG GTGGCGGCCA GCTGGCCCGG 
ATGACCCATC AGGCGGCCAT TCCGCTCGGC CAGACCCTGC GGGTGCTCTC GATCTCGGCC
GAGGAGAGCG CGGCCCTGGT CACGCCGGAC GTGATGATCG GCCATCACAC CGACCTGGAT
GCCCTGCGCC GGTTCGCGCA GGGGTGCGAC GTCGTCACCT TCGATCACGA GCATGTGCCC
GGTGAACACA TCCGCACCCT GGTCGCCGAA GGCTTTGCCG TGCACCCGGG CGCGGACGCA
CTGCAATTCG CGCAGGACAA GGCGCTGATG CGCACCCGGT TGGCCGAACT CGGGGTGCCG
GTCCCGGCCT TCGCGGTGAT CGCGGCCGAC GACCCGGCCC GTGACGACCG GATCGTGGCG
TTCGGCGACG CGCACGGTTG GCCCTGCGTG GTCAAGACCG CGCGCGGCGG GTACGACGGC
CGCGGGGTGT GGGTGGTGCG GTCGGCGACC GAGGCGCCCG AGCTGGACCT GCCGGACGGG
GGCCAGCTGG TGCTGGAGGC CTTCGTGCCG ATGCGCCGGG AGCTGGCCGC CGTGGTCGCC
CGGTCGCCCT TCGGCCAGGC CGCGGCCTGG CCGGTGGTGC AGACCGTCCA GCAGGACGGG
ATCTGCGTCG AGGTGATCGC CCCCGCACCC GGACTGGACG GCGACGTCGC GTCGGCGGCC
GGGCGGCTGG CCCTGCAGGT CGCCGGGGAG CTCGGCGTCG TCGGCATCCT GGCCGTCGAG
CTGTTCGAGG TCGACCCCGG ACCGGACGCG CCCGACGGGA TCCTGGTCAA CGAGTTGGCC
ATGCGCCCGC ACAACTCCGG CCACTGGTCA ATGGACGGGG CGGTGACCGG CCAGTTCGAG
CAGCACCTGC GGGCCGTCCT GGACTACCCG CTGGGCCGCA CCGACCTGCT CGCCCCGTTC
ACCGTGATGG GCAACGTGCT CGGTGGCCCG GCCGACGGCC CCGGTGCGGG CATCGGCATG
GACGAGCGTG TCCATCACCT GGCTGCCCGG TTCCCGCAGG TCAAGGTGCA TCTATACGGC
AAGGCTTTCC GGCCCGGGCG CAAGCTCGGG CATGTCAATG TGCTCGGCTC GGATCTGGGT
GAGCTGCGGC GCGTCGCCGC GCTGGCCGCG ACCTGGCTCA GCGAAGGCGT GTGGGCCGAC
GGTTGGAACG CCCATGCCGC CGATCCCCGC GCAGCACGAC CGCAGGAGGT GGCGGCGCGA
TGA
 
Protein sequence
MDERTGMPRV GMVGGGQLAR MTHQAAIPLG QTLRVLSISA EESAALVTPD VMIGHHTDLD 
ALRRFAQGCD VVTFDHEHVP GEHIRTLVAE GFAVHPGADA LQFAQDKALM RTRLAELGVP
VPAFAVIAAD DPARDDRIVA FGDAHGWPCV VKTARGGYDG RGVWVVRSAT EAPELDLPDG
GQLVLEAFVP MRRELAAVVA RSPFGQAAAW PVVQTVQQDG ICVEVIAPAP GLDGDVASAA
GRLALQVAGE LGVVGILAVE LFEVDPGPDA PDGILVNELA MRPHNSGHWS MDGAVTGQFE
QHLRAVLDYP LGRTDLLAPF TVMGNVLGGP ADGPGAGIGM DERVHHLAAR FPQVKVHLYG
KAFRPGRKLG HVNVLGSDLG ELRRVAALAA TWLSEGVWAD GWNAHAADPR AARPQEVAAR