Gene Nmar_1561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1561 
Symbol 
ID5774697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1429128 
End bp1430219 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content34% 
IMG OID641317213 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_001582895 
Protein GI161529069 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAAA GAATCTCAAT TGTAGCTGTT GTTGTTGCAA TTGCTTTTTC TGTATACACC 
TTAACACTTC CATCAGATCC TATTCCATTA CCAGAACCTA ACTTTAATTC TAAAAATGAC
TCCTTTGATA TTTTAGCAGA GAATCTAGAA AAGCCTCGTT CAATTGCAGT ATCTGATAAT
CGAATTTTTG TAACAGAAAA AGATGGTTCT ATTGTAGTAA TTGAAAACGA TATTCAACTA
GAATCCCCAC TTGCTACTTT TCGTTCTGCT AATGTTTTTG ATGGTGGATT GTTAGGAATT
GATTTACATC CAAATTTTGC AGAGAACCAT TACCTCTATG TATTTTTGAC TTATGAAGAA
GATGAAGCAT TGTGGAATAA AATACTACGA ATAACCGAAT CTGAAAATAA ATTACAAGAT
GCAGAAACTA TTTTTGATAA AATTCCAGGG TCTTCTTTTA CAAATGGTGG TTTTATCAAA
TTTGGACCTG ATGGAAAGTT GTATGTAGGT ACAGGTGCTA CATCTGATTC ATCTCATTTA
CCGCAAGATC TTGATTCACT TTCAGGTAAA ATTTTACGAA TAAACGATGA TGGTTCAATT
CCTGACGATA ATCCTTTTTC AAATTCTCCT GTATACTCTT TAGGACACAG GAATCCTCAA
GGAATGACTT GGGATAATAA TGGTAACTTG TATGTATCAG AATTTGGACC TGAAAAAAAT
GATGAAATCA ATATAATTTT GGCAGGTAAG AATTATGGTT GGCCAGAACA AGAATGCTCT
GGCAATGAAA GTTTTGAAAA TGCTGTTCTT TGTTATGATC CAAGCATAGA GCCTGGAGGA
ATCTTGTACT ATACTGGTGA CAAATTCGAT TTTGAATTCC CTTTCATTAT GGCTTCAATG
AGGGCATCAA ATGTCTATCA AGTAGATTTT GATGAGGGAT TGAGTTCTCA AAAATCTATT
CTTAGTGGAA TTGGACGTGT TCGTGACGTG GTTCAAGGTC CTGATGGATA TCTCTATGTG
ATTACTTCTA ACACTGATGG AAAAGGTTTT CCAGCTGCTA ATGATGATAA ATTATTGAGG
ATATTGAAAT AA
 
Protein sequence
MDKRISIVAV VVAIAFSVYT LTLPSDPIPL PEPNFNSKND SFDILAENLE KPRSIAVSDN 
RIFVTEKDGS IVVIENDIQL ESPLATFRSA NVFDGGLLGI DLHPNFAENH YLYVFLTYEE
DEALWNKILR ITESENKLQD AETIFDKIPG SSFTNGGFIK FGPDGKLYVG TGATSDSSHL
PQDLDSLSGK ILRINDDGSI PDDNPFSNSP VYSLGHRNPQ GMTWDNNGNL YVSEFGPEKN
DEINIILAGK NYGWPEQECS GNESFENAVL CYDPSIEPGG ILYYTGDKFD FEFPFIMASM
RASNVYQVDF DEGLSSQKSI LSGIGRVRDV VQGPDGYLYV ITSNTDGKGF PAANDDKLLR
ILK