Gene Nmar_1078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1078 
Symbol 
ID5773246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp982395 
End bp983390 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content36% 
IMG OID641316720 
Productfructose-bisphosphate aldolase 
Protein accessionYP_001582412 
Protein GI161528586 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1830] DhnA-type fructose-1,6-bisphosphate aldolase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTGGG GATTAAAAAA CAGATTATCT AGTATAATTA AACCACACAA TAACCGCGCA 
CTTATGTTAG CAGTTGATCA TGGATATTTT CTTGGACCAA CTGAGAGATT AGAGAATCCA
AAAAAGGTCA TTGCACCTCT ATTGAAACAC TGTGATTCTT TGATGTTAAC TAGAGGTGTT
CAGAGAACAT CTGTTCCTGC AGAAACTGAT ACTCCTATGG TACTTCGTGT ATCTGGTGGT
TCTAGTATTA TTGGTGATGA CTTGTCTCAA GAAGACATTA CAGTATCAAT CCAAGATGCC
ATTAGACTAA ATGCTAGTGC CCTTGCAATG TCTATCTTTG TAGGCTCAAA ATATGAATAT
CAAACAGTTG TTAATCTCGG AAAACTAGTC AGCGAAGCAG AGCAATATGG CATTCCGGTT
TTGGCCGTAA CTGCAGTTGG CAAAGAATTG GGCAAAGATG CAAGATATCT CTCTCTAGCT
TGTAGAATGG CTGCAGAACA AGGCGCACAT ATTGTAAAAA CATACTATTG TGATAATTTT
GAAAAAGTTG TTGAATCTTG TCCTGTACCA ATTATTGTTG CAGGAGGAAA GAAAATCCCA
GAACGTGATG CATTACAATT AACTTACAAT GCTGTCAAGG CAGGTGCTGT TGGTGTTGAT
ATGGGACGAA ACATCTGGCA ATCTGATCAT CCAGTTGCCA TGATTAGAGC AACAAGAGCA
ATTATTCATC AAAATGCAAA TGTTGATCAA GCTTTCAAAC TATACAAAAA ACTTGCAAAC
GAAGATTCAA ACAAGAAACA AAAATCAAAA GGCAAAAAGC CAAACCAAAA CAAATCAAAA
GGAAAGAATC CTAATCAAAA CAAAACCAAA GGCAAAAAGC CAAACCAAAA CAAATCAAAA
GGAAAGAATC CTAATCAAAA CAAAACCAAA GGCAAAAAGC CAAACCAAAA CAAGTCAAAC
AAACCCCAAA ACAAACCTCA ACCAAAAAAG AATTAA
 
Protein sequence
MDWGLKNRLS SIIKPHNNRA LMLAVDHGYF LGPTERLENP KKVIAPLLKH CDSLMLTRGV 
QRTSVPAETD TPMVLRVSGG SSIIGDDLSQ EDITVSIQDA IRLNASALAM SIFVGSKYEY
QTVVNLGKLV SEAEQYGIPV LAVTAVGKEL GKDARYLSLA CRMAAEQGAH IVKTYYCDNF
EKVVESCPVP IIVAGGKKIP ERDALQLTYN AVKAGAVGVD MGRNIWQSDH PVAMIRATRA
IIHQNANVDQ AFKLYKKLAN EDSNKKQKSK GKKPNQNKSK GKNPNQNKTK GKKPNQNKSK
GKNPNQNKTK GKKPNQNKSN KPQNKPQPKK N