Gene Nmar_1644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1644 
Symbol 
ID5772978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1502127 
End bp1503287 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content33% 
IMG OID641317298 
Producthypothetical protein 
Protein accessionYP_001582978 
Protein GI161529152 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTTTTG CTGAAGTTTA TGTTCCATTA CATGAGTACT TGGGATATTT TGATTCAACT 
GGAATTTACA CTGTAGTTGG TAATGTTAAA AATGAAAATG ATTTTGCAAT AATTCCTACA
ATTACCGTTT CTGTAATTGA AAATTCTGAA ACAATTTCAA AAACTATTCA GCATGTTCCA
CTTGCTGCTG GAACAGAAAT TCCATTTAAG ATAAAATTTC CTGAAGTACA ATCAAACACT
CCAGTTCTAG TTAATCCTGA ATTAATTTAT GAACAAACAA TGACTAATCC AGTTCCAATC
CAAATTCTTT ATGATAAGAC ACTAGTCAAA CATGAAGATG GTCATATATC AGGCAGAATT
CAAAACACTG GAAATGAAAC AATACACTTT CCAAAAATTT TTGCAGTTGT TCATGGATAT
GAAAAAGTTC TAGATATTAC TCAAAATATC GAATATATTG AAAAAATTGA ACCTGGAGAA
ATTCTAGACT TTACAATGTA TCCTGATCCT TCAGTAACTG AGGATATCTT TTACTATAGT
TGCTTTGCAC CAGTTGATAC TACTGTAATT CCTGTGACTG CAAAGAAGAA TGGTGGTGAT
TTTGATTTCA GATACGATTC AGGTGCATGG TATTCAGCTG CAAAATTTGA TGAATCTGGA
ACAACAATGA CAATTAGAGG TTACAATAGT TATCCATTAG AGACATATGC AAACTTTGAA
TTTGCTCCAA TTTCTGGAAA TGAAAAATTT TCTGTCACAC TAAATGACGA ACCTATAGAA
TTTATCCAAA GCATTGATGA TATGGGATTC TGGCATGTTG CATTTACTGT TGAGCCTCAA
TCCCAAGGTG TTTTGAAGAT TTCAGGTTTT GACAAAGGAT TACCTCCTGA ACTTCCTACA
GTTCCTGTTT GGGTAAAACA AAATGCTGAC TGGTGGGCAA CTCAACAAAT TCCTGATTCA
GAATTTTTAG AAGGAATTGA CTTTCTTTTT GAAAAACAAA TCCTATCTGT TCCAACGCGT
GAAGTAGTTT CTGAATCACA ATGGAAGATT CCTCAATGGG TACAAATTCC TGCAGGTTGG
TGGTATGAAG AAAAAATTAC TGATGAACAA TTCTTAAACA TAATTGAGAA TCTAGTACAA
CGAGAAATTA TTGTAGTTTG A
 
Protein sequence
MSFAEVYVPL HEYLGYFDST GIYTVVGNVK NENDFAIIPT ITVSVIENSE TISKTIQHVP 
LAAGTEIPFK IKFPEVQSNT PVLVNPELIY EQTMTNPVPI QILYDKTLVK HEDGHISGRI
QNTGNETIHF PKIFAVVHGY EKVLDITQNI EYIEKIEPGE ILDFTMYPDP SVTEDIFYYS
CFAPVDTTVI PVTAKKNGGD FDFRYDSGAW YSAAKFDESG TTMTIRGYNS YPLETYANFE
FAPISGNEKF SVTLNDEPIE FIQSIDDMGF WHVAFTVEPQ SQGVLKISGF DKGLPPELPT
VPVWVKQNAD WWATQQIPDS EFLEGIDFLF EKQILSVPTR EVVSESQWKI PQWVQIPAGW
WYEEKITDEQ FLNIIENLVQ REIIVV