Gene Nmul_A2641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2641 
SymbolprfA 
ID3785252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp3026322 
End bp3027398 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content58% 
IMG OID637812730 
Productpeptide chain release factor 1 
Protein accessionYP_413320 
Protein GI82703754 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0216] Protein chain release factor A 
TIGRFAM ID[TIGR00019] peptide chain release factor 1 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA GTATGACTGC CAAGCTCACG CAACTCAGCG TGCGCCTGGA GGAGTTGAAT 
CGTCTGCTGA GCAGCGAAAG CATCACGGTC AATCTCGATC AATACCGCAA GCTGACGCGC
GAACGTGCGG AGATTGCCCC CGTGGTCGAC CTGTATAACG CTTATCTGCA AAGCGAGCAG
GATATTCATA CCGCGCAGGA GATGGCTGCC GAGGCGGAAA TGCGCGAGTT TGCCGATGCC
GAGATACGGG ATGCCAAAGA GAGGCTGGTG CGTTATGGCG CGGAATTGCA GAAGCAATTG
CTGCCCAAAG ACCCGAACGA TGAGCGCAAT ATTTTTCTGG AGATCCGGGC CGGTACCGGA
GGAGACGAGT CCGCGCTGTT TGCGGCGGAC CTGTTCCGCA TGTATGCACG CTTTGCTGAA
CGCCAGCGCT GGCAGGTGGA AATCATTTCG CAAAGCCCGT CCGACGTCGG CGGATATAAG
GAAATCATTG CCAAGATCAG CGGTGAAGGC GCCTATTCCA AACTCAAGTT TGAATCGGGT
GGGCACCGGG TGCAGCGCGT GCCAGCGACC GAAACGCAGG GACGTATCCA TACTTCCGCT
TGCACCGTCG CGGTGATGCC GGAGGCAGAC GAGATCGAGG ATGTCGCGCT CAATCCTGCC
GAGCTGAGGA TCGATACTTT CCGTGCTTCC GGGGCGGGAG GCCAGCATAT CAACAAGACT
GATTCTGCCG TGCGCATCAC TCACCTGCCG ACGGGAATCG TGGTCGAATG CCAGGATGGC
CGTTCCCAAC ATAAAAACAA GGCGCAGGCG ATGAGCGTGC TGGCTGCGCG CATCCGCGAC
AAGCAGATGC AGGAGCAGCA AAGCAAACAG GCGGCAACGC GCAAGTCGCT GGTAGGCACG
GGTAATCGTT CAGGACGTAT CCGCACTTAC AATTTTCCCC AGGGGCGGAT AACGGATCAC
CGCATCAATC TGACGCTGTA CAAGATCGAG CAGATCATGG ATGGGGATTT GAACGAGCTT
TGCTCGGCCC TGCTGGCCGA GCATCAGGCC GAGCAGTTGG CGGCAATGGC GGAGTAG
 
Protein sequence
MNKSMTAKLT QLSVRLEELN RLLSSESITV NLDQYRKLTR ERAEIAPVVD LYNAYLQSEQ 
DIHTAQEMAA EAEMREFADA EIRDAKERLV RYGAELQKQL LPKDPNDERN IFLEIRAGTG
GDESALFAAD LFRMYARFAE RQRWQVEIIS QSPSDVGGYK EIIAKISGEG AYSKLKFESG
GHRVQRVPAT ETQGRIHTSA CTVAVMPEAD EIEDVALNPA ELRIDTFRAS GAGGQHINKT
DSAVRITHLP TGIVVECQDG RSQHKNKAQA MSVLAARIRD KQMQEQQSKQ AATRKSLVGT
GNRSGRIRTY NFPQGRITDH RINLTLYKIE QIMDGDLNEL CSALLAEHQA EQLAAMAE