Gene Nmar_1087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1087 
Symbol 
ID5773912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp991041 
End bp992597 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content32% 
IMG OID641316729 
Producthypothetical protein 
Protein accessionYP_001582421 
Protein GI161528595 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAAATT TACTTGTTTT ACTTTTTGTT TTAGGTTTCA GTCTTGTTGG CTCTGCTGAA 
CTAGTATTTG GACATGGATT TGGAAGCGAA ACCTTGCCCC CTGCGTCCAT AGGTGATAGA
GATGTCACTT TTTCAATATC TGTATCGCCT TCTATTTTTG ATCCTACTGT AAATGAACAC
TTGATAACCA TGAATCTTTT TGATTCAAAA ACAGAGGCAG CAATTGAACA TGTGACATTT
GAAGTAGAAT TTTTAAAAAA TGACCAACAA CTTTTCAAAG AAGTTTTTCA TGATGAAACT
GGCACTCTGA AATTATCTGT AATATCTGAT GATTCTGATG AGATTTCCAT TCAAGGAACT
CAAGAATCTG TTTTAGGAGG ATGGCTAGTA GATGAACAAC ATCCATTAAC ATTTACTGGA
CCTGTTTTCA CTTCTGGAGG CCTTTATGAG TACAAAGTAA AAATTTTGTC TATCGATTCT
GATTCAAATG TTTTGAAAAA ACCTATTGAA TTTGAAGGTG GCATAAGTAT TGCAGACCAC
CAATATTTCA ATGTGGATGA TCATTTGAAA CAATCTCAAA AACTACATGT TGTTTCTTAT
TTTGATCAAA TTCAAGATTT TAATTTTGAC TCAAACCAAC TAACTTTTAC CATTCCTTTT
GATTGGAATC AAAATTTTCA AGAGATAAGT GTGATTCATC AAGAAGTACG AATTCCAAAT
ACGTTTAGTG ATTTTCTTTC TACAAATTAT GATTCATATG TAAATGGACT TTTACTGCCT
CACGACATTA CAACTATTGA TGATTATTCT TTTGATGATC GAACTGTACA CACTGTGATT
ACTCGTGATT TGCTCAAGTT GTTAAAGAAT AATGTGCAAA CATCTGATGA AATAATTGAG
TTTAAACTAC AACCAAATGA CAAAGTTGAT TTTCCATTAG ATTTTACAAC CTCTGATTAT
CAAGTATTCT TGTATTGGGA ACCTGAAATT ATTCATGCAG GTGAGGATGT GACATTTTTC
ATAGATTTTC AGCAAATCTT TTCAGACCAT CATAAACACC ATGTGGAATA TGATTTTTCA
GTAATCCAAC AGGGTAAGAC CATTTATCAA AATCATTTCA AAGGTGATAT TGATTCTGAT
TATTCAAATA TCCATCAAGT AAACTTTGAT TCAAAATATT CTGGCTCAGC AAATCTTGTT
GTATCTAACA TTGATGGTGA TTCTGAATCA AAAGGAAATT TTATTATCGT AATTGAGCCT
GGCATATCTG CAAATTCTGA GACAAACGAA ATTCCATCTT GGGTAAAAAG TAATGCTGGT
TGGTGGGCAG ATGGGTCAAT TGATGATGAT TCTTTCATTC AAGGAATTCA GTTTTTAATT
GATGAAAACA TTATTCAAAT TCCTCCTACT TTGACTGGTT CAAATTCTCA AACAAACGAA
GTTCCTGTAT GGGTTAAAGT CAATGCTGGT TGGTGGGCTG ATGGCACAAT TGATGATGAT
GCCTTTGTAC AAGGAATGCA ATTTTTGATA AAATCTGGAA TAATTTCTGT AAACTAA
 
Protein sequence
MKNLLVLLFV LGFSLVGSAE LVFGHGFGSE TLPPASIGDR DVTFSISVSP SIFDPTVNEH 
LITMNLFDSK TEAAIEHVTF EVEFLKNDQQ LFKEVFHDET GTLKLSVISD DSDEISIQGT
QESVLGGWLV DEQHPLTFTG PVFTSGGLYE YKVKILSIDS DSNVLKKPIE FEGGISIADH
QYFNVDDHLK QSQKLHVVSY FDQIQDFNFD SNQLTFTIPF DWNQNFQEIS VIHQEVRIPN
TFSDFLSTNY DSYVNGLLLP HDITTIDDYS FDDRTVHTVI TRDLLKLLKN NVQTSDEIIE
FKLQPNDKVD FPLDFTTSDY QVFLYWEPEI IHAGEDVTFF IDFQQIFSDH HKHHVEYDFS
VIQQGKTIYQ NHFKGDIDSD YSNIHQVNFD SKYSGSANLV VSNIDGDSES KGNFIIVIEP
GISANSETNE IPSWVKSNAG WWADGSIDDD SFIQGIQFLI DENIIQIPPT LTGSNSQTNE
VPVWVKVNAG WWADGTIDDD AFVQGMQFLI KSGIISVN