Gene Nmar_1548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1548 
Symbol 
ID5773854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1417196 
End bp1418527 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content34% 
IMG OID641317200 
ProductUBA/THIF-type NAD/FAD binding protein 
Protein accessionYP_001582882 
Protein GI161529056 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2
[COG1977] Molybdopterin converting factor, small subunit 
TIGRFAM ID[TIGR03603] bacteriocin biosynthesis cyclodehydratase, SagC family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.47369 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAATA TCACATTTAC AATTCCATCT GTGTTAAATC ACGGTGGAGG AGAAAAAAAG 
ATAGAGATTC CAGCTGATTC CTTACAAGAT GTATTTACAA AAATCTCTGA ACAAATGGGT
GATGATTTTA AGAGACGCGT GTTGGAAGGT GATGGAACTC CTCGTTCATT GATTAATATT
TACATTAATG GTAAAAATGC TAAATTTTCT TCTGGAATGG AAACTGCTCT AAAAGATGGA
GATGAAATCT ATATTTTACC TGCAGTTGCT GGTGGCTCTG AAGAACTTTC TCCAAAAGAA
CTTGACAAAT TTTCAAGACA AGTTATGCTT GAAGAGATTG GATATGGTGG ACAATTAAAA
TTAAAAAATG CTAAAGTGTG TGTTGTTGGA ACTGGAGGTT TAGGACATCC AATAATTTCT
AGACTAGCTA CAATGGGAGT TGGAAATTTA CGAATTATTG ATAGGGATGT AATTGAACTA
TCAAACTTAC ATAGACAAAT AATGTTTGAT GAAGACGATG TTGGTCAGGT TAAAGTAGAA
GTAGCTGCAA AAAAATTACA GAAACTAAAC CCTGATTGTA AAATTGAAGC TCTAGCTGTT
TCAATAAATG ATTATACTGC ATTAGAAGTT GTTGAAGGAT GTGATGTTGT AATTGATGCA
CTTGATAGTG TTAATGCTAG ATATGCACTA AACAAAGCAT GTGTAAAATA CAATATTCCA
TTTGTTACTG GCGCTGCAGT TGGAACATCT GGACAAGCAT TTACTGTATT GCCAAAAGAA
AGCGCATGTT ACTTTTGCAT GTTTCCTGAA TTAAACGAAG ATACAATGCC AACATGTAGT
ATTGAAGGTG TTCACCCCCC TATACTTTCT ATTGTTGGTG CAATTGAAGT TGCAGAAGCT
GTAAAAATAA TTCTTGGAAA AAAACCAAAT CTATCTGAAA GAATTTTACA TATTGATTTA
GAAAGTCTTG ATTTCAATAG TACTAGAACA TTCAGAGCTG ACGAATGTCC AATTTGTGGA
ACAGGGAAAC TTGAAGTTGT ACAAAAAGAA GAATTAATTT TAGAAGAATT GTGTGGAAGA
AATAGAGGAA AGAGAACTTA CTCTATTACT CCAACTGATA CTTTTGAACT TGATGTTGAT
GCAGTTACTA ACATTGCAAA ACAAAAAGGA TTTCTTGTAG ATAACCAGGG TGACTTGGGG
TTGTCAATGC GAACAAATGA TTTGTCTGTA AGTTTTATGA AAAAGGGGTC TGCAGTAGTT
GTTGGACCAA AAGATGAGGA TGATGCAATT TCTTTGTATA ATTGCCTCTT AGGTAAAGAG
ATCAAGGCAT AA
 
Protein sequence
MANITFTIPS VLNHGGGEKK IEIPADSLQD VFTKISEQMG DDFKRRVLEG DGTPRSLINI 
YINGKNAKFS SGMETALKDG DEIYILPAVA GGSEELSPKE LDKFSRQVML EEIGYGGQLK
LKNAKVCVVG TGGLGHPIIS RLATMGVGNL RIIDRDVIEL SNLHRQIMFD EDDVGQVKVE
VAAKKLQKLN PDCKIEALAV SINDYTALEV VEGCDVVIDA LDSVNARYAL NKACVKYNIP
FVTGAAVGTS GQAFTVLPKE SACYFCMFPE LNEDTMPTCS IEGVHPPILS IVGAIEVAEA
VKIILGKKPN LSERILHIDL ESLDFNSTRT FRADECPICG TGKLEVVQKE ELILEELCGR
NRGKRTYSIT PTDTFELDVD AVTNIAKQKG FLVDNQGDLG LSMRTNDLSV SFMKKGSAVV
VGPKDEDDAI SLYNCLLGKE IKA