Gene Nmar_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1034 
Symbol 
ID5773459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp907775 
End bp909073 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content41% 
IMG OID641316676 
Productelongation factor 1-alpha 
Protein accessionYP_001582368 
Protein GI161528542 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG5256] Translation elongation factor EF-1alpha (GTPase) 
TIGRFAM ID[TIGR00483] translation elongation factor EF-1 alpha
[TIGR00485] translation elongation factor TU 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGATA AACCACACTT GAACCTGATT GTTACAGGTC ATATTGATAA TGGAAAATCA 
ACAACTATGG GTCATTTCTT GATGGATCTT GGCGTTGTAG ATGAAAGAAC AATTGCATCC
CATGCATCCG AATCCGAGAA GACCGGAAAA GGTGATACTT TCAAGTATGC TTGGGTAATG
GATAACATTA AGGATGAAAG AGAGAGAGGT ATTACAATCG ATCTTGCATT CCAAAAATTC
GAGTCCCCAA AGTACTTCTT TACTTTGATT GACGCTCCTG GTCACAGGGA CTTTATTAAA
AACATGATTA CTGGTGCTTC TGAAGCCGAC GCAGCAGTCT TAGTACTTTC AGCTAAAGAA
GGTGAAACCG ATACTGCAAT TGCTGCAGGT GGTCAAGCAA GAGAACACGC ATTCTTGCTT
AAGACTTTAG GTGTAAACCA ACTAATCGTT GCAATCAACA AAATGGATGA TAGCAACTAT
TCTGAAGAAG CATTCAAAGT AGCCAAAGAA AAAGGTGAAA AATTAGTAAA ATCTGTAGGT
TACAAACTAG AAAATGTACC ATTCATTCCA GTTTCTGGAT GGAAAGGTGA CAACTTGGTT
AAAAAATCCG AAAACATGTC ATGGTACTCT GGTAAAACAC TACTTGAAGC ATTTGATGAC
TTTACAGTAT CTGAAAAACC AATCGGTAAA CCACTACGTG TTCCAATCCA AGATGTTTAT
ACCATCACTG GTGTAGGTAC CGTTCCAGTA GGTAGAGTTG AGACCGGAGT TATGAAAGCA
GGAGACAAAA TCGTCGTAAT GCCTTCAGGT GCTCCTGGTG AAATCAAATC TATTGAAACT
CACCACACAG AGATGCCATC TGCAGAAGCA GGTGATAACA TTGGTTTCAA CCTTAGAGGT
GTTGAGAAGA AAGACATCAA GAGAGGAGAT GTTCTCGGAA GTCCTGACAA CCCACCAAAT
GTTGCAAAAG AATTCAAAGC ACAAATTATT GTAATTCACC ACCCAACAGC AATCGCACCT
GGTTACACAC CAGTTATGCA CGCACACACT GCACAAGTTG CAGCAACAGT TACTGAGTTC
TTACAAAAGA TCAACCCAGC ATCTGGTGCA GTTGAGGAAG AAAATCCAAA ATTCCTCAAA
GTTGGTGACT CTGCAATTGT AAAAATCAGA CCGGTGAGAC CAACATGTAT CGAAACTTTC
CAAGAATTCC CTGAGATGGG TAGATTCGCC CTTAGAGATA TGGGTGCTAC TATCGCAGCA
GGAATCGTTA AGGAAATTAC CGAAGAGTAC AAACCATAG
 
Protein sequence
MADKPHLNLI VTGHIDNGKS TTMGHFLMDL GVVDERTIAS HASESEKTGK GDTFKYAWVM 
DNIKDERERG ITIDLAFQKF ESPKYFFTLI DAPGHRDFIK NMITGASEAD AAVLVLSAKE
GETDTAIAAG GQAREHAFLL KTLGVNQLIV AINKMDDSNY SEEAFKVAKE KGEKLVKSVG
YKLENVPFIP VSGWKGDNLV KKSENMSWYS GKTLLEAFDD FTVSEKPIGK PLRVPIQDVY
TITGVGTVPV GRVETGVMKA GDKIVVMPSG APGEIKSIET HHTEMPSAEA GDNIGFNLRG
VEKKDIKRGD VLGSPDNPPN VAKEFKAQII VIHHPTAIAP GYTPVMHAHT AQVAATVTEF
LQKINPASGA VEEENPKFLK VGDSAIVKIR PVRPTCIETF QEFPEMGRFA LRDMGATIAA
GIVKEITEEY KP