Gene Nham_1235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_1235 
Symbol 
ID4032229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp1393177 
End bp1394292 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content66% 
IMG OID637969714 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_576523 
Protein GI92116794 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGATT CTCGGCGGGT GACGCTTCAG CCGGGCGACA CCATCGGAAT TCTCGGCGGC 
GGACAACTCG GCCGGATGCT GGCGTTGGCG GCGGCGCGCC TGGGCCTCAA GTGCCAGGTG
TTCTCGCCAG ATCCGGACTC GCCGGCGTTC GACGTCGTGC AGAACGCCAC CTGTGCCGAA
TACGCCGACG TCGAGGCGCT GGAATTGTTC GCAGCCGACG TCGACGTCGT CACCTATGAA
TTCGAGAATG TCCCGGCGGC GACCGCGATG GTGCTCGCCG CGCGCCGGCC GGTGCTACCC
AACCATCGCA TTCTGGAGAC GGTGCAGGAC CGGCTGGTTG AGAAGAACTT CATCACCGGA
CTCGGTATCG GCACCGCCGC CTATACCGAC GTGGCGTCGG CGGAGGCGTT GCGCGAGGCG
ATCGAAATCA TCGGCCTTCC GGCGGTGATC AAGACCCGCC GCTTCGGCTA CGACGGCAAA
GGTCAGGCCA TCATTCGCGA GGGAGACGAT CCCAGCCAGG TGTGGGACGA TCTCGGCACC
AAGGCGGCGA TCCTGGAAGC CTTCGTAGCG TTCGAGCGCG AGATTTCCGT GATCGTCGCG
CGCAGCGCCG ATGGCGGCGT CGAATGCTTC GACGTGACCG AGAACGAGCA TCGCGATCAC
ATCCTGAAAT ATTCGCGCGT ACCAGCGGCG ATCCCCGATA CGCTCGCCGC TCAGGCGCGC
GACATCGCGC AAAAGATCGC GATCGCGCTC GACTATGTCG GCGTACTGGC CGTCGAGATG
TTCGTGGTGC CGGGCGCCGG CGGGCCGACG CTGCTCGTCA ACGAGATCGC GCCGCGGGTC
CACAACTCCG GGCACTGGAC GCTCGACGGC GCCTCGATTT CGCAATTCGA GCAGCACATC
CGCGCGGTCG CCGGCTGGCC GCTCGGCAAG CCGGTCCGGC ACGGCAGCGC AGTCATGACA
AACCTGATCG GCGACGACAT CCTCGACTAC GGGAAGTGGC TCACCGTGCC GGGCGCAGCC
GTGCATATCT ACGGCAAGGG CGCGCCGCGC CCCGGCCGCA AGATGGGCCA TGTCACCGAG
ATCAAGCATG TCACCGAGGT CAAGGGCCGC GGCTGA
 
Protein sequence
MSDSRRVTLQ PGDTIGILGG GQLGRMLALA AARLGLKCQV FSPDPDSPAF DVVQNATCAE 
YADVEALELF AADVDVVTYE FENVPAATAM VLAARRPVLP NHRILETVQD RLVEKNFITG
LGIGTAAYTD VASAEALREA IEIIGLPAVI KTRRFGYDGK GQAIIREGDD PSQVWDDLGT
KAAILEAFVA FEREISVIVA RSADGGVECF DVTENEHRDH ILKYSRVPAA IPDTLAAQAR
DIAQKIAIAL DYVGVLAVEM FVVPGAGGPT LLVNEIAPRV HNSGHWTLDG ASISQFEQHI
RAVAGWPLGK PVRHGSAVMT NLIGDDILDY GKWLTVPGAA VHIYGKGAPR PGRKMGHVTE
IKHVTEVKGR G