Gene Nmar_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0100 
Symbol 
ID5773637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp90221 
End bp91510 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content33% 
IMG OID641315720 
Productglutamyl-tRNA(Gln) amidotransferase subunit D 
Protein accessionYP_001581438 
Protein GI161527612 
COG category[E] Amino acid transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0252] L-asparaginase/archaeal Glu-tRNAGln amidotransferase subunit D 
TIGRFAM ID[TIGR00519] L-asparaginases, type I
[TIGR02153] glutamyl-tRNA(Gln) amidotransferase, subunit D 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.00378073 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCAGAAT ATAGAGGATA TGAAGGCAAT TCGTTAGAAT TTCTAAAGAG CAATCAAGTA 
GTTGTTGGAG ATTCAGTCAA AATCCTCTCA GATATTACAT ATTCAGGCAT AATTATGCCT
CGATATGAGC ATAGTGACGA CAAACACATT GTTCTGAAGT TAAAGAGTGG GTATAATGTG
GGATTAGAAA TTGCAAAAAT TGAGAGTATT GAAAAAATTC AATCATCAGA AAAAAATATT
GATGAACTTG AAAAAATCAG CAAAATAGAC GGATTACCAA AAGTATTGTT ACTTTCAACA
GGAGGTACGA TTGCAAGTAA AGTAGACTAT AGAACAGGAG CAGTTACTCC AGTGCTAACT
GCAGAAGAAT TGAATTCATC AGTTCCAGAG CTTTCTAAAA TTGCAAATAT TGATGCAGAA
GTTTTGTTAT CAGAATATTC TGAAAACATT ATGCCAGAAA ATTGGTTAGA GATAGCTAAT
AAAATTAGTA GTTATTCAAA TTCAGACTAT TCAGGAATTA TCATTGCTCA TGGAACAGAC
ACAATGCATT ATACATCATC ATTTCTTTCA TTTGCACTCG CAGGATTTCC AGTTCCAATT
GTTTTAGTTG GTTCACAAAG ATCATCGGAT AGAGCATCAT CAGATGCAGC ATTAAATCTA
ATTGGTGCCA CTAAATTCAT TACCGAAAGC AAAACAAAAG GAGTGTACAT TGTTATGCAC
AATGATGAAA ATGACAATAC CGTTGCATGT CACATTGGAA CAAGAGTTAG GAAAAATCAT
ACAAGTAAAC GAGGAGCATT TCAAACAGTA GGAGATGATC CTGCTTTCAT AATTGCAGAA
GAAAAAATTC AAAAAAATAT TTCAAAAGAG TTCTATAAAG TTCAAAAATT CCAACCAAAA
ATTAATCTAG ATACAAAAAT TGCATTAGTA AAATACTATC CAGGATATGA TCCAAAATTA
GTTGAACAAA TTATTGACAA CGGATACAAA GGAATAATCT TTGAAGGTAC AGGATTAGGA
CATATTGGAA GGGTCATGTA TGATTCTGTA AAAAAAGCTA GTGAAAAAGG GATATTTCTA
GGCATGACAT CACAGTGTAT TGATGGAAGG GTAAGAATGA CCGTCTATGA AAGTGGCAGA
GATCTTCTAA ATTTAGGCAT AATTCCTTTA GAGAATATGC TTCCAGAAGT TGCTCTAGTA
AAAGCAATGT GGGCATTAGG AAATACTCAG AATATTGAGG AAGTAAAAGA AATTATGCTT
GATAATATTG CATCTGAAAT GTCAATTTAG
 
Protein sequence
MSEYRGYEGN SLEFLKSNQV VVGDSVKILS DITYSGIIMP RYEHSDDKHI VLKLKSGYNV 
GLEIAKIESI EKIQSSEKNI DELEKISKID GLPKVLLLST GGTIASKVDY RTGAVTPVLT
AEELNSSVPE LSKIANIDAE VLLSEYSENI MPENWLEIAN KISSYSNSDY SGIIIAHGTD
TMHYTSSFLS FALAGFPVPI VLVGSQRSSD RASSDAALNL IGATKFITES KTKGVYIVMH
NDENDNTVAC HIGTRVRKNH TSKRGAFQTV GDDPAFIIAE EKIQKNISKE FYKVQKFQPK
INLDTKIALV KYYPGYDPKL VEQIIDNGYK GIIFEGTGLG HIGRVMYDSV KKASEKGIFL
GMTSQCIDGR VRMTVYESGR DLLNLGIIPL ENMLPEVALV KAMWALGNTQ NIEEVKEIML
DNIASEMSI