Gene Nmar_0603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0603 
Symbol 
ID5774118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp537875 
End bp539041 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content32% 
IMG OID641316238 
Producthypothetical protein 
Protein accessionYP_001581937 
Protein GI161528111 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCAT TCTTGCTAAA GAAAAAACGC CTAAAAACAT GGGATGAAAA ACTATCCAAC 
AAATCCCAGA GGACAAAAGA TGGTTATACT TTAGTTATCA AATCCTTTGA AAAATTCTGC
AGCGAGTATT ATGGAGGTAG AACAAAAGAC GATATCTTTG ATGAATTATC TGTTCTAAAA
GATGCTGAAA AGACTCTTGC TACTGTAGAT TTGATTCAAA ATTGGATTAA TTGGCATTAT
TCTCATGGTG TAAAAACATC CGTTGTAAAG TTGTATCTTG CATGGCTAGG AAAATACTTT
GATTATAGAG AGATTTCAAT AACTCAAAAG ATAAAGGATG AACTTGACTT TAAACGCGAT
CTAAAAGACG AACCCTTTGC ACTTGAAATT CAACATATTC AGAATATTTT CAAATTTGCT
AGTCCTAAAA AGATTGGATT CTATCTCGCA CTAGTCTCTA CTGGCGCAAG ACCTGCTGAA
CTATTACAGG TAAAAAAGCG TGACATTATC ACATCTACAA AAAGACTCAA GGTATTGATT
CAACCTGAAG GTGTAAAGAC TCGACATGGA CGTTCAGCAT ATCTTACAAA AGAAGCTGCA
CGATACTGTT TGATGAGATT ACGTCAAATT AGTGATGATG ATCTAGTCTG GGGTAAACAT
GAAGATTATA GCAAAACAGA AAAAGCAGAA TCAAAGACAT TTTCAAGATA TTGTGATAAT
GCAGGCTATG TTGAACGATA TCATTCTAAT AATTATAGAA AAATCACCCT CTATTCTTTT
AGGTCCTTTT TCTTTAGTGC TGCAGCAGAC GTAAATCGTG AAGGATATGC ACACAAAATG
ACTGGTCATG GGGGATATCT GTCTCAATAT GACCGAATGT CTGATGAAAA GAAACTTGAA
TGGTTTTTGA AAGTAGAGCC ATTTTTGACT ATAGATGATG ATGAAAGATT ACAACTTGAA
AATAAACAAC TAAAGAAGGA AAATACAGAG AAAAAACAAT TCGAAGAAGA AATCAAAAAT
TTAAAGAAAA GACAAGTAGA GCTTGAATAT AATCAAAAAG AATACGAATC AATCAAACCT
GATGTAGAGA AACTTGTTTT AAATTATTTT GAAGAACTTG GAGAAGATTT TTTCAGAAAA
GTATTTTCAA AAAATAGCAT AAATTAA
 
Protein sequence
MSSFLLKKKR LKTWDEKLSN KSQRTKDGYT LVIKSFEKFC SEYYGGRTKD DIFDELSVLK 
DAEKTLATVD LIQNWINWHY SHGVKTSVVK LYLAWLGKYF DYREISITQK IKDELDFKRD
LKDEPFALEI QHIQNIFKFA SPKKIGFYLA LVSTGARPAE LLQVKKRDII TSTKRLKVLI
QPEGVKTRHG RSAYLTKEAA RYCLMRLRQI SDDDLVWGKH EDYSKTEKAE SKTFSRYCDN
AGYVERYHSN NYRKITLYSF RSFFFSAAAD VNREGYAHKM TGHGGYLSQY DRMSDEKKLE
WFLKVEPFLT IDDDERLQLE NKQLKKENTE KKQFEEEIKN LKKRQVELEY NQKEYESIKP
DVEKLVLNYF EELGEDFFRK VFSKNSIN