Gene Nmar_1638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1638 
Symbol 
ID5774751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1494591 
End bp1496462 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content35% 
IMG OID641317292 
Producthypothetical protein 
Protein accessionYP_001582972 
Protein GI161529146 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACATT TATCGAATGA ACCATTCCCA TTATATTCAA AACAATGTTT TAGATTTCAT 
ATGGCAAAAA GTATGGGAAC AATTGCAATT TTCGCATTTA TGGTAATTTT AACATCTACA
ATTACAATAT CCCCAGTCTT AGCTGATACT GGTTTTACAA ATGTACAAAA ATCAGCAGGA
ATTATAATGA AATTCTGTGC AAATGAAACT TTCAAACTAC AAGATTGTAA TGAAAGATAT
GAAGGAATTG GCTGGACTGA TAGAGTAAAC GTCTTGATTT ATGCTCCAGG ATGGAACGAG
GATGATGATA AAATTGAACA AATTGGTACC ACATCAAATC CAATTGATGT CTACACTGAT
GCCAACCGTG TAAATGGCGT TGAGTTTACA GAAACTGGCC CTGATACTGG CATATTCATG
GGAGTTGTAA AATTGACAGG TGCAATGCGT TATACTGTTC ATGATACATT TCTTACTACT
GTTAAAACAC CTGGAATGAC TATGGATCCA GATGGAATGA ATATTTCAGC ACATGATAGG
GCTGTAATGA TTGCAACATC TACACAAGAT GGTAGATTAA CAGTTGACTG GGAATATAAT
GAAGATCAAC ATGTTTACAA AACTGCATAT TATACTTGGC AAATGGGACA AGCTGAATTC
CACAAAGATA CCTATGATGT AAATGAAAAA GTCACATTTT TCATACGTGA TACCGACTTG
TGGAAGCACC ACCGAGAATT TTTCACAAAT TATGTTAAAG TATATTCAGA TTCAGATAAA
GCAGGAATAT TTGTTGGTGT TCAATTTGTA AAAGATATGG ATCATGCAAA AATTCAGAAT
GCAGTATATG ATCGTCACTT GAGTGAACCA GCTGCAAGCT CATTAACAAA ATACACTCCT
GATGGAGAAT GGAAAACATA TCTCTGGACT GAACCAGGTG GTGTAATTGG TGTTGATCAA
GATTATGACT TTAACTTAAT GGTTCATGAT GGCTTAACTG ACATCCACGA GATGGGATTG
TCTTATGATA TGGATATCTA TCTTAACGGT GAATTAATTG AATCAAGAAA TGATCAATAT
TGGGTAGACG GACAAGGTGT AGAACCAATT CGCTTTGATG AGAGAGGCTC TGCTAAAATT
GTAGTTTCTA ACATCTTTGA TCAACCTGGT CAAGAAGTAA ATTTCTCATT CCAAGTTGCA
CCTGAAGCAA TTTTAGAAGA AGTTGTACCT AGACATGGTT CCTTTGAAGT TGGAAGTACT
CCTAATTACT TTGTAGGATA TGAACATCCT CACTATATCA ATTATCTTCC AGGCGAGTTC
TTTATGACCA CTGGAGATTC TTCTCAAGAG CAAAATAGAT TGAGAGTTAC AAATGGTGAT
ACAATTTACA TTGAATATGA AGACATTACA TTACCACGAC CATACACTAC TGCTGATAGT
ATGGAAATAG TTGCAAGAGC ATTAGTTCTT GATACTGGCG TTCATATGGT TTCAGATGAT
TCAGAGATAT TTGTTGAAAC ACCTAGACCT ACAGTAACTC CGGTTTCAAG TGACATTGAT
ATGTCAAAAC CTACAATAAC TTCTGTTTCA ACTGACATTG CAATTCCGGA TTGGGTAAAG
AAGAATGCAA TGTGGTGGTC TGATGGACAA ATCAATGATC CAGACTTTGC AAAAGGTATT
GAGTATCTAG TTCAAGAAAA TATCATTAGT GTATCTGCTG CAGAAGAAAT TGTTGATGAA
GATGTAAACA TAACATCAAT TCCAATGTGG GTAAGAAATA ATGCAGGTTG GTGGTCTGAA
GGTCATCTTA CTGATGTAGA ATTTGCAAAT GGAATCAAAT TCTTGATGGC ATCTGGATTA
ATCAAAGTCT GA
 
Protein sequence
MVHLSNEPFP LYSKQCFRFH MAKSMGTIAI FAFMVILTST ITISPVLADT GFTNVQKSAG 
IIMKFCANET FKLQDCNERY EGIGWTDRVN VLIYAPGWNE DDDKIEQIGT TSNPIDVYTD
ANRVNGVEFT ETGPDTGIFM GVVKLTGAMR YTVHDTFLTT VKTPGMTMDP DGMNISAHDR
AVMIATSTQD GRLTVDWEYN EDQHVYKTAY YTWQMGQAEF HKDTYDVNEK VTFFIRDTDL
WKHHREFFTN YVKVYSDSDK AGIFVGVQFV KDMDHAKIQN AVYDRHLSEP AASSLTKYTP
DGEWKTYLWT EPGGVIGVDQ DYDFNLMVHD GLTDIHEMGL SYDMDIYLNG ELIESRNDQY
WVDGQGVEPI RFDERGSAKI VVSNIFDQPG QEVNFSFQVA PEAILEEVVP RHGSFEVGST
PNYFVGYEHP HYINYLPGEF FMTTGDSSQE QNRLRVTNGD TIYIEYEDIT LPRPYTTADS
MEIVARALVL DTGVHMVSDD SEIFVETPRP TVTPVSSDID MSKPTITSVS TDIAIPDWVK
KNAMWWSDGQ INDPDFAKGI EYLVQENIIS VSAAEEIVDE DVNITSIPMW VRNNAGWWSE
GHLTDVEFAN GIKFLMASGL IKV