Gene Nmar_1286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1286 
Symbol 
ID5773221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1178045 
End bp1179244 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content37% 
IMG OID641316930 
Productargininosuccinate synthase 
Protein accessionYP_001582620 
Protein GI161528794 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0228509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAA AAGGAATTCT TGCATTCTCT GGAGGACTAG ATACTTCAGT TGTTGTAAAG 
TATCTACAAG ACGAACATGA CATGGATGTC ATCACAGTTA CAGTAGATGT TGGACAAGGT
GATGACAATA AAAAAATTGC AGCAAAAGCA AAAAAGCTTG GAGTAAAAAA ACACTATAAC
ATTGATGCAA GAAAAGAGTT TGTCAAAGAT TACATTTTTC CATCAATTAA AGCAAATGCA
CTTTATCAAA AGAAATATTG TCTTGCAACA GCACTTGCAA GACCACTAAT TGCTGAAAAA
GTTTTAGAGA TTGCAAAAAA AGAAAAAGTT ACATCTTTAG CTCATGGTTG TTCAGGAAAA
GGAAACGACC AAGTACGTTT TGACATTACA CTACGTTCTG GTTCTAACTT ACCAATCATT
GCACCAATTC GTGACAAAAA CTTGGACAGA GTTACAGAAT TAAAGTTTGC AAAGAAACAC
GGAATAGAGA TTGATACTGT AGCAAAAAAG TTTAGTATTG ATCAAAACTT GTGGGGCCGT
GCAATCGAAG GCGGTGTACT AGAAGATCCA TACAACGAAC CACCTGATGA TGCATTCATT
TGGGTAAAGA CAAAAAATTT GCCAGATAAA CCAACATACT TGGAGATCAA ATTCGAAAAA
GGAATTCCTG TTGGGGTTGA TGGAAAAACA ATGGATGGTC AAAAACTAAT AGAGTACATC
AACAAAAAGG CAGGAGATGC AGGAGTTGGA ATTGTTGATC ATATTGAAGA CAGGGTTGTC
GGAATAAAGT CGAGAGAGGT CTATGAGACA CCAGCTGCTA CTTGTTTGAT TGAAGCCCAT
TCAGACTTGG AAAAGATGGT CCACACAAAA CACGAAAACA AGTTCAAATC AATAATTGAT
GATGAGTGGG CATACCTAGC ATATTCAGGA TTGTGGCAAG ACCCACTAAA GTCAGATTTA
GACGGATTTA TCGAGGCATC CCAAAAGCCA GTTACAGGAA CAGTCAAGCT AAAGTTGTAC
AAAGGAAGTC TAAGAGTGGT TGGAAGAAAG TCCAAGAACT CACTTTACAG CCATGAGATT
GCCACATATG GTACAGAATC CACATTTGAC CAAAGATTGG CCAAAGGATT TGTAGAATTG
TGGGGAATGC AATCAACAGA AGCTAATAAA TTACAAAAGA AAAGGTCAAC AAAAATATGA
 
Protein sequence
MTQKGILAFS GGLDTSVVVK YLQDEHDMDV ITVTVDVGQG DDNKKIAAKA KKLGVKKHYN 
IDARKEFVKD YIFPSIKANA LYQKKYCLAT ALARPLIAEK VLEIAKKEKV TSLAHGCSGK
GNDQVRFDIT LRSGSNLPII APIRDKNLDR VTELKFAKKH GIEIDTVAKK FSIDQNLWGR
AIEGGVLEDP YNEPPDDAFI WVKTKNLPDK PTYLEIKFEK GIPVGVDGKT MDGQKLIEYI
NKKAGDAGVG IVDHIEDRVV GIKSREVYET PAATCLIEAH SDLEKMVHTK HENKFKSIID
DEWAYLAYSG LWQDPLKSDL DGFIEASQKP VTGTVKLKLY KGSLRVVGRK SKNSLYSHEI
ATYGTESTFD QRLAKGFVEL WGMQSTEANK LQKKRSTKI