Gene Nmar_0701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0701 
Symbol 
ID5773491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp640471 
End bp642387 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content37% 
IMG OID641316337 
Productacetate--CoA ligase 
Protein accessionYP_001582035 
Protein GI161528209 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR02188] acetate--CoA ligase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATA CCTACAATAT TGGGCTTGGA AATAATGACA CTGACACTAG AATAAAGGCT 
GATTCAGATT TTGTTTCATT TTGGAATGAT CAGGCAAAAA ACCTAACTTG GTTTTCTCCT
TGGAATGAGA CTTTGGATTG GCAGCCACCC TTTGCAAAAT GGTTTGTTGG AGGCACGATT
AATGCTTCTC ATAATGCATT AGATGTACAC CAGGACTCAA AATCTGAAAA ACCTGCTATC
CTATGGGAGG GTGAGAATGG TGACTCTAGA ATTCTCACAT ATGGCCAGAT ACTCACTGAG
GTCCAAAAAT TCTCAAATAT TCTCAAATCT CTTGGTGTGG AAAAAGGTGA TCGTGTTACT
TTGTATCTTC CAATGATTCC TGAATTGCCA ATTGCAATGC TTGCGTGTGC TAGAATTGGC
GCAACTCATA CTGTTATCTT TTCAGGATTT AGTGCAACAT CAATTAGAGA TAGAGTTGAT
GATTCAAAAT CCAAAGTTAT AGTTACTGCT GATGGTGGTT ATCGTCGTGG AAAAATTGTA
AAACTAAAAG AAGTAGTTGA TGAGGCAATT GAAGACTTTG ATTTTGTAAA AAATGTTGTT
GTTGTAGAGA GAACCAAAAA TGAAATTCCA ATGACTTGTA AAGATAAACT TTGGAATGAT
TTAATGAATG ATGCATCTGA TAATTGCCCT GCAGAAAAAT TAGACAGTGC ACACCCACTT
TACATTTTGT ATACTTCTGG AACAACTGGA AAACCAAAAG GTGTTTTACA TGGTACTGGC
GGATACTTGA CTCATCTTTA TTCTACTTTC AAATGGGCAT TTGACATTAA AGATTCTGAT
GTGTTTTTTT GTACTGCTGA TATTGGGTGG GTAACTGGAC ACAGCTATGT TGTTTATGCA
CCATTACTAC ATGGCGCAAC TGAAATTATG TATGAAGGCG CACCTGATTT TCCTGACGCA
TCAAGGATGT GGGATATTTT ACAAAAATAC AAAGCCACAA TTTTCTACAC CACCCCAACT
GCTCTTAGAA TGTTTATGAA GTTTGGAGAC GACATTCCAA ATTCCTTTGA TCTTTCTACA
TTACGATTGC TTGGAACAGT TGGCGAACCA ATCAATCCTG AAGTTTGGAG ATGGTATTTC
AAAACCATTG GTAAAGAAAA ATGTCCAATC ATTGATACTT GGTGGCAAAC TGAAACGGGA
GGAATGTTGA TTTCCCCACT TCCTGGCCTT GAAACAATTC CTCTCAAACC TGGCTCTGGA
ACTCTTCCAA TACCTGGTGT GAATATCACT GTTGTAGATG AAAATGGCAA AGATGTTGAG
CCTAATACCA AGGGATATCT TGTTGTCAAG AACCCTTGGC CTGGAATGCT TTTGACATTG
TGGGGTGATG ATGAAAAATA CAAGACAGTA TACTGGTCAA AATACGAAAA TTGCTACTAT
CCTGGTGATT ACGCACTAAA GGATGCAGAT GGATATCTTT GGTTACTTGG ACGTGCTGAT
GATGTTCTAA AAGTTGCAGG TCATAGAATT GGAACTGCAG AACTTGAAAG TTGCATTGTC
TCGCATGATG ATGTTGCTGA GTCTGCTGCA TGTGGTATTC CTGATGAAGT AAAAGGTGAA
GTAATTATTG CATTTGTTGT ACTAAAAGAA GGCATTAACA CTGAAACCAA AGTTTTAGAA
AAAGAACTTG TTGAAAAAAT AAGAACCGAT ATTGGTGCTA TTGCTACTCC AAAACAAATC
TACTTTGTAT CTAAATTGCC AAAGACAAGA AGTGGAAAGA TTATGCGTCG ATTACTAAAA
GCAATTGGAA ATAATGAAAA GATTGGTGAT GTTAGTACTC TGGAAGATGG CGCTGCTGTT
ATTGAAGTTC AAACTGCTTT TGATGAGATT CAAAAATCAA TCAAAGAATC AAACTAG
 
Protein sequence
MSDTYNIGLG NNDTDTRIKA DSDFVSFWND QAKNLTWFSP WNETLDWQPP FAKWFVGGTI 
NASHNALDVH QDSKSEKPAI LWEGENGDSR ILTYGQILTE VQKFSNILKS LGVEKGDRVT
LYLPMIPELP IAMLACARIG ATHTVIFSGF SATSIRDRVD DSKSKVIVTA DGGYRRGKIV
KLKEVVDEAI EDFDFVKNVV VVERTKNEIP MTCKDKLWND LMNDASDNCP AEKLDSAHPL
YILYTSGTTG KPKGVLHGTG GYLTHLYSTF KWAFDIKDSD VFFCTADIGW VTGHSYVVYA
PLLHGATEIM YEGAPDFPDA SRMWDILQKY KATIFYTTPT ALRMFMKFGD DIPNSFDLST
LRLLGTVGEP INPEVWRWYF KTIGKEKCPI IDTWWQTETG GMLISPLPGL ETIPLKPGSG
TLPIPGVNIT VVDENGKDVE PNTKGYLVVK NPWPGMLLTL WGDDEKYKTV YWSKYENCYY
PGDYALKDAD GYLWLLGRAD DVLKVAGHRI GTAELESCIV SHDDVAESAA CGIPDEVKGE
VIIAFVVLKE GINTETKVLE KELVEKIRTD IGAIATPKQI YFVSKLPKTR SGKIMRRLLK
AIGNNEKIGD VSTLEDGAAV IEVQTAFDEI QKSIKESN