Gene Nmul_A0417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0417 
Symbol 
ID3784167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp463465 
End bp464769 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content56% 
IMG OID637810493 
Producthypothetical protein 
Protein accessionYP_411117 
Protein GI82701551 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCCT GGAGTCTCGA CAGAATTACG CAGGCATTGA GTGGTGCAAT GTGGACGTTT 
TCCTCTCTGC CTGAGGACAA CTCGGCCGCC ACTTTCGTCC ATCGATGTTT CCGGCGGGAA
AGCTGGCGAG GTGCCGGGTT TCGCGAACGC ACCCTCATGT ATGCAGCCTT ACCGTTTGCG
CCATTCGTCA CCCTGGTGTT GGCTATCGCC TTCACTGCGC TTAACGGACA GGCTATAAAG
AAGCGGACCG GAAAAGGGAT AATCCGGCAA ATCCGGGAAC AGTTTGAGAT TGCGCTGCGA
TACGCGATTT TGCCCCCCTG GTATTATATT TTTGAACTGC ATGACGGCGA CAAGCGGCGG
CGCGCTTCTG AATACATCAA CCGCTTTGAA GTAAAAACCT GCCTCTACCG CATCCTCCGT
GACTACAATG GCGGCCTTCC CATCCCTGCA GAGCGCAGTA CCTTCTGCAT CAAGGACAAA
TTATGCTTCC TGTCACGCTG TCGCAGGTTC TCTATTGCCA CAGCTCCTGT GTTTTTGATT
GTCTCGAAGG GGGAGATCAA AGCAATTGAT TGGGGCGGGT CCCTGCTCCC GGAAACCGAT
CTGTTCGTGA AACCGCTCCA GGGGGAGAAC GGAAGGAACG CGGTACGATG GGATTATCTG
GGTTCGGGGC AATATCGGCG CAACGATGGT AAACACGCCA CCGCTCAAGA GGTGCTGGAG
GGGTTATGCA AGGCATCATG GCGCAGGTCT TTCCTGGTGC AGCCCCGGCT TATTAACCAT
AGAGAAATTG CCGATCTTGC AAATGGCACC TTGGCGACGA TACGGGTAAT GAGTTGCCGC
AATGAGCGGG GCGAATTCGA GGCAACCAAT GCGGTTTTTC GAATGGCGCA AAATGAGACC
GTAGTTGTCG ATAACTTTCA CAGAGGCGGA ATCGCAGTCA ATGTCGATCT TCATACCGGC
AAATTGGGAA GGGGCGCCTG CGGAGCGTGG GGATCCACAG GAGGAGGATG GTACGAGCGA
CATGACAAGA CGGGTGTGCA GATTCTGCAC CGCGAGCTTC CGTGCTGGCC CGAGTTGCTC
GCGATGGTTC GATACGCCCA TGGGAGCGCC TTCTCCGACC AGGTGGTAAT CGGCTGGGAT
GTTGCCCTGC TCGACAGCGG GCCGTGCATG GTTGGAATCA ACAAGGCCCC CGATCTGGAC
ATGATCCAGC GGATAAGCCG GCGTCCGCTG GGTAACGAGC GGTTCGGAAA GCTTCTGGCA
TTCAACCTGG AACGCACTGT CGAGGCTGTG CATCAATCTT CTTAA
 
Protein sequence
MSAWSLDRIT QALSGAMWTF SSLPEDNSAA TFVHRCFRRE SWRGAGFRER TLMYAALPFA 
PFVTLVLAIA FTALNGQAIK KRTGKGIIRQ IREQFEIALR YAILPPWYYI FELHDGDKRR
RASEYINRFE VKTCLYRILR DYNGGLPIPA ERSTFCIKDK LCFLSRCRRF SIATAPVFLI
VSKGEIKAID WGGSLLPETD LFVKPLQGEN GRNAVRWDYL GSGQYRRNDG KHATAQEVLE
GLCKASWRRS FLVQPRLINH REIADLANGT LATIRVMSCR NERGEFEATN AVFRMAQNET
VVVDNFHRGG IAVNVDLHTG KLGRGACGAW GSTGGGWYER HDKTGVQILH RELPCWPELL
AMVRYAHGSA FSDQVVIGWD VALLDSGPCM VGINKAPDLD MIQRISRRPL GNERFGKLLA
FNLERTVEAV HQSS