Gene Nmul_A1822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1822 
Symbol 
ID3784917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2080040 
End bp2081326 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content58% 
IMG OID637811909 
ProductPepSY-associated TM helix 
Protein accessionYP_412511 
Protein GI82702945 
COG category[S] Function unknown 
COG ID[COG3182] Uncharacterized iron-regulated membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.646512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGAG TGATCACGGT GCCGCCGGGC GTAGCCAGGG CACAGGATAC CCGGCGCGTG 
GAGAAACTTT CGGTCGGGCG CAACTTTCTA GTACTGGCGC ACCGCTGGGC AGGTCTTCTA
TTGGCCGCGT TTCTTTTCGT ATCCGGCCTT ACCGGCGCGG TCATCTCCTG GGATCATGAG
CTGGATGAGT GGCTGAACCC CCGGCTCTTC CAGGCCAGGA ATACTGGCGG CATGCCGCAG
CCTCCACTGC TGCTGGCCGA CCGGCTGGAA GCGGCGGACC CCCGGTTAAT GGTGACCTGG
CTTCCACTCT CCGTCGAGCC GGGCCATAAC CTTGGGCTGG CGGTGAAGTC CCGTCTCGAC
CCGGCAACAG GCATGGCCTT CAATCTGGAT TTCAACCAGA TAGCCCTCGA TCCGGTTGAC
GGGGAAGTGC GTGGCAAGCG CATGTGGGGT GAAATTTCGC TCAGCCGTGA GAACCTGTTG
CCGTTTCTGT ATAAGCTGCA TTACAGCATG CATATTCCGG ATGGGTTTGG AATCGAGCTG
GGAATCCTGT TCATGGGGAC TCTCGCGATT ATCTGGGCAC TCGATTGCTT CATTGCTCTG
TGGATTTCAT TTCCCAAGGC GAGTGCATGG ACCAAATCCT TTGTATTCCG CTGGCGGCAG
GGAGGAGCAA GGCTGAACTT CGATCTGCAT CGATCCGGTG GGGTGTGGGT ATGGGGATTC
CTGCTGGTTC TGGCGGTGAC CGCCGTGTCG ATGAATCTCA ACCAGCAGGT CATGCGGCCG
CTGGTGTCGC TGTTTTCGAC GCTGTCGCCC AGCCCCTTTA CACGTACTCC CAATCCTCCC
GACCAGCCTA TCGAGCCGAT GGTGGATCGC CACACCATCC TGCAGTATGC GATAACCGAA
GCGGAAAAGC GTGAATGGAG CACGCCTCCC GGCGGCATAT TTTATGATCC CGAGGTAGGT
GTTTATGGTG TCATCTTCTT CGAACCGGGG AACGACCATG GCGATGCGGG GCTGGGGAAC
CCCTCGCTTT TCTTTGACGG CAAGGATGGA ACATCCGTCG GAGCGAATGT GCCGGGTGAG
GGCAGTGCGG GTGATATTTT CATGCAGGCG CAGTTTCCGC TGCATTCCGG ACGTATCGTC
GGGCTTCCCG GGCGCATTTT CGTATCCCTC ATGGGCCTGC TGGTGGCGAT GCTGTCAGTT
ACGGGAGTGA TCATCTGGCA GAAGAAGCGC TGGGCGCGAA AGAAAACTTA TGAAGGGAGC
AGGAGAGATA TAGCTGTATT GTCCTGA
 
Protein sequence
MKRVITVPPG VARAQDTRRV EKLSVGRNFL VLAHRWAGLL LAAFLFVSGL TGAVISWDHE 
LDEWLNPRLF QARNTGGMPQ PPLLLADRLE AADPRLMVTW LPLSVEPGHN LGLAVKSRLD
PATGMAFNLD FNQIALDPVD GEVRGKRMWG EISLSRENLL PFLYKLHYSM HIPDGFGIEL
GILFMGTLAI IWALDCFIAL WISFPKASAW TKSFVFRWRQ GGARLNFDLH RSGGVWVWGF
LLVLAVTAVS MNLNQQVMRP LVSLFSTLSP SPFTRTPNPP DQPIEPMVDR HTILQYAITE
AEKREWSTPP GGIFYDPEVG VYGVIFFEPG NDHGDAGLGN PSLFFDGKDG TSVGANVPGE
GSAGDIFMQA QFPLHSGRIV GLPGRIFVSL MGLLVAMLSV TGVIIWQKKR WARKKTYEGS
RRDIAVLS