Gene Nmar_0572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0572 
Symbol 
ID5773316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp508749 
End bp510014 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content35% 
IMG OID641316206 
Producthypothetical protein 
Protein accessionYP_001581906 
Protein GI161528080 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1914] Mn2+ and Fe2+ transporters of the NRAMP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTCAA AACTTTCTCA TTTTTCAAAA ACAGCAGGAC CTGGAATTTT ATTTGCTTGT 
ACTGCCATAG GCGTCTCACA CTTAATCCAG TCGACTCGTG CTGGCGCTGA TTTTGGTTTA
ATGATCTTGG GGTTTGTTGT GCTTGCAACT GTGCTCAAAT ATCCTTTCTT TGAGTTTGGT
TCTCGTTATG CAAACTCTAC TCAAACAAGT ATCATTGATG GATACAAAAA ACTTGGAAAG
CCAGCACTGT GGATGTATTT TATAATCACA ATCTGTTCAA TGTTTTTTGT AACTGGTGCA
GTAGGATTTG TAACTGCTGG ATTTTTCGAG AATTTGTTTG GAATTGATTT TCTTGAAGAG
TTCACTGTAG TTATTTTATT TGCAATTTGT GTTGGGATTC TAGCTGTTGG AAAATACAAC
GTTCTTGATA GTTTGATTAA GATTATTGCA ATTGTTTTAC TTGTCTCAAC TGTTTCAGCC
TTTTTGTTAG CATTGTATAA TGGTCCAATA GAAGAGGTTT CTGGATTTGC TCCAAAAGAA
CTTTGGAATT TGACTGGAAT CTTTTTCTTA TTAGCATTAA TGGGTTGGAT GCCTGCTCCT
CTTGATCTTT CTAGCTGGAA TAGTTTATGG ACACTTGAAA GATCAAAACA AACTAGTTAT
CGACCAAAAT TAAAAGAAAC ACTGTTAGAA TTCAGATTGG CTTATCTGAT AACTGGTATC
TTGGCAGTCA TGTTTGTTGT ATTAGGTACT TTCATTTTCT ATGGTTCAGG TGAAGAATTA
CCAAACAGTA ATTCTAGTTT TGCTCACAAA GTTGTAACAT TGTATACTCA GACTATTGGT
GAGTGGAGTT ATGTCGTTAT TGCAGCTTCT GCATTTTCTG TAATGTTTGG AACCATAATT
GCGTTGTTTG ATGGGTATTC TCGTTCCTTA CAAAGAACTG TAGAATTGAT TTTTTCTAAA
AAAGAAGAGG CAATACGTAC AAAATTCAAG ACATTTTATG TTATTTTTCT GATCGTTCTG
TCGGTTGGCT CATTGATTGT GATTTTCCAA TTTGCAGGAA ATCTAAAAGA ATTAGTCGAT
TTTGCTACCG TTTTGTCTTT TGTAGTCGCG CCAGTTATTG CGATATTTAA TTTCAGATTG
GTTACTGGAA AGTATCTTCC AAAAGAATCT CAACCTTCAA CTCTGTTGAG AATTTTGAGT
TTTGCAGGAA TTGTATTCCT TAGTGGATTT GCAATATTCT TCTTGGCAAT GAAATTCTTA
TCATAA
 
Protein sequence
MGSKLSHFSK TAGPGILFAC TAIGVSHLIQ STRAGADFGL MILGFVVLAT VLKYPFFEFG 
SRYANSTQTS IIDGYKKLGK PALWMYFIIT ICSMFFVTGA VGFVTAGFFE NLFGIDFLEE
FTVVILFAIC VGILAVGKYN VLDSLIKIIA IVLLVSTVSA FLLALYNGPI EEVSGFAPKE
LWNLTGIFFL LALMGWMPAP LDLSSWNSLW TLERSKQTSY RPKLKETLLE FRLAYLITGI
LAVMFVVLGT FIFYGSGEEL PNSNSSFAHK VVTLYTQTIG EWSYVVIAAS AFSVMFGTII
ALFDGYSRSL QRTVELIFSK KEEAIRTKFK TFYVIFLIVL SVGSLIVIFQ FAGNLKELVD
FATVLSFVVA PVIAIFNFRL VTGKYLPKES QPSTLLRILS FAGIVFLSGF AIFFLAMKFL
S