Gene Nmar_0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0140 
Symbol 
ID5774448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp129568 
End bp130626 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content29% 
IMG OID641315760 
Productpyruvate carboxyltransferase 
Protein accessionYP_001581478 
Protein GI161527652 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000000672144 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCCAAAA AAGTTCAAAT CCTTGACACT ACATTACGAG ATGGCAGTTA TTCTGTGAAT 
TTTTCATTTA CTAGTTCTGA TACATCTATA ATCTGTTCTA AATTGGAAAA ATCTGGAATT
AAATTAATTG AAGTTGGTCA TGGACTTGGG TTTAATGCTT CTAACTCTGG TTATGGAAAA
TCTACACAAT CTGATGAAGA ATATATGATT GCTGCAAAAG AGTCTTTATC AAAATCAATG
TATGGAATGT TTTGTATTCC TGGAATTGCT AAACTATCTG ATTTAGAACT TGCTAAAAAA
CATGGTATGG GATTTATTCG AGTAGGTACC GATGTTACCA AAGTACATCA ATCTGAAAAA
TTTATCAAAA AAGCAAAAAA TCTTGGATTT TTTGTGGCCT CAAATTTTAT GAAATCTTAT
GTAATGCCAC CTGATAAATT TGCATCAATT GTTAAACAAT CTGAAGAATT TGGAACTGAT
ATGGTGTATA TTGTGGATTC TGCTGGAGGT ATGTTTTCTT CAGATTTGTT AGAATATTAT
AATTCAATAA GAAACGTATC TGAAATTCCA CTTGGTTTTC ATGGTCATGA TAATTTAGGT
ATGGCAATTT CAAACAGTCT GTATGCTGCT GATTTAGGTA TGGAATACAT AGATTCTTCT
CTTCAGGGAA TTGGAAGAAG TTCTGGAAAT GCTTGTACTG AAGTTTTAGT TATGGCATTG
AAGAAAAAAG GATTCAAGAT AGATGTTGAT TTTCATAGTC TCTTTGAAGC AGGACAAGAA
TGTGTTTACC CATTAATCAA TAATTCTAAT AAATTACCCC TTGATATTGT TTCAGGTTAT
GCTGATTTTC ATTCAAGTTA TATGCATCAT ATAATGAAAT ATTCTTCCAA GTTTAAAGTT
GATCCATTAT TATTGATTAT AGAATATTCT AAAATTAATA AAATTGATAT TGATGAAAAA
AAATTAGAAC AAATTGCTAA AAAATTAAAG AGAAAACAGG ATATTTACAC TGCAAAATAC
AGATTTAACA GATATGTCGG AAGAGAACAA GATAATTAA
 
Protein sequence
MSKKVQILDT TLRDGSYSVN FSFTSSDTSI ICSKLEKSGI KLIEVGHGLG FNASNSGYGK 
STQSDEEYMI AAKESLSKSM YGMFCIPGIA KLSDLELAKK HGMGFIRVGT DVTKVHQSEK
FIKKAKNLGF FVASNFMKSY VMPPDKFASI VKQSEEFGTD MVYIVDSAGG MFSSDLLEYY
NSIRNVSEIP LGFHGHDNLG MAISNSLYAA DLGMEYIDSS LQGIGRSSGN ACTEVLVMAL
KKKGFKIDVD FHSLFEAGQE CVYPLINNSN KLPLDIVSGY ADFHSSYMHH IMKYSSKFKV
DPLLLIIEYS KINKIDIDEK KLEQIAKKLK RKQDIYTAKY RFNRYVGREQ DN