Gene Nmar_1557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1557 
Symbol 
ID5774360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1424527 
End bp1425597 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content32% 
IMG OID641317209 
Productclass I/II aminotransferase 
Protein accessionYP_001582891 
Protein GI161529065 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATT CGTGGTATGA AAAAAAATTA GAAGACTTTG CAAAACTAGG TGGTTACAAA 
AAACCTGAAA AATTTGATGA TGTTTTAAAA CTGGATTCAA ATGAGAATTT TGTAATAAGC
AAAAAATTCC AACAAGATGT TATTGCTTAT GCAAAATCAA ATTCTGATGT AAGAGAATAT
CCATTAGGAG GAGTTGAAAA ATTAGTATCA AAACTAGCAA AATATCTCAA AGTTTCTGAA
AATATGATTG GTGTTGGAAA TGGTTCAGAC CAAATCTTGG ATCTGTTTTT AGCAAATATG
GCCTCAAAGA AAACTAGGAT TTTAACATCT GATCCAACAT TTGGGTTCTT TGAAGAACGA
TGCAAACTAT ATGCTATTCC TTGTACTAAA ATTCCATTTT CATCTGACAT GAAACTTGAT
ATTGAAAAAT TCAATTCAAA CTTGAAAAAA TGTCACATTC TTTATTTGGA TTCTCCAAAC
AATCCTACTG GATTTCAATT CTCAAAAGCT CAACTAGAAT CTTTAATCAA AAAATTTGAT
GGATTGGTAA TCATTGATGA GGCATATGGT GAATTTGGTG ACTCATCAAT TGTTTCATTG
ACAAAAAAAT ATGATAACCT AATTGTTGTC AAAACATTTT CCAAAGCATT TGGACTTGCA
GGTTTACGTA TAGGATACTT TGTTGCAAAC AAGAAAATAG TTGAAGTCTT TAACCAAGTA
TTACAATATC CGTATCCTCT CAATACATTG GCAATCGAGG CTGGAATTGC CTCTTTGGAC
AAAGTTGATC AAATGAAAGA GGCATCTGAA ATCATCAAAT CTGAGCGTAA AAAAATTATT
GAGAATTTGC GCAAGTATGA TGCCTTTACT GTTTTTGATT CCAATGCTAA TTTTGTACTA
TTTGATGCTA AAGGCGCAGA CAAACGCGTA TTCTCTGCAC TAGTAGAACA AGGGATTTCT
ATACGCAAAC TAGGTAAGAT TGGTTCACAT TCAGGATGCT TGAGAGTTAC TGTTGGAACC
AAGGAAATGA ATTCAAAATT CCTTTTAGCA ATACGTGATC TTTTAGGATA A
 
Protein sequence
MKNSWYEKKL EDFAKLGGYK KPEKFDDVLK LDSNENFVIS KKFQQDVIAY AKSNSDVREY 
PLGGVEKLVS KLAKYLKVSE NMIGVGNGSD QILDLFLANM ASKKTRILTS DPTFGFFEER
CKLYAIPCTK IPFSSDMKLD IEKFNSNLKK CHILYLDSPN NPTGFQFSKA QLESLIKKFD
GLVIIDEAYG EFGDSSIVSL TKKYDNLIVV KTFSKAFGLA GLRIGYFVAN KKIVEVFNQV
LQYPYPLNTL AIEAGIASLD KVDQMKEASE IIKSERKKII ENLRKYDAFT VFDSNANFVL
FDAKGADKRV FSALVEQGIS IRKLGKIGSH SGCLRVTVGT KEMNSKFLLA IRDLLG