Gene Nmul_A0641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0641 
Symbol 
ID3785414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp736043 
End bp737203 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content53% 
IMG OID637810723 
Producthypothetical protein 
Protein accessionYP_411340 
Protein GI82701774 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGTTATC TGAATATCAT CGCAGCCATG TTGTTTTTTA CCTCAACTGT CCATGCTGCG 
GTAACGGAGG ATGCCTATAT CGCCGGTTAC GCGGCGGGTG TCCTGAAACA TGGCTTTGGA
ATGGAGATTC CGACTCTGGT GGTGAAAGAT GGCATCATTA CCGTACCTGA AGATAAACTG
AAATCCGAGA ACCAAGCGCA GGTTGTTCAG GCACTGTCGA AAATCCCTGG TGTGACCGGA
GTGACCATAG CGGAAAAAAG TGTCAGCAGA ATCGCAGAAC GCGAGGCATT CAAGCCTATT
CAACACCCTT CCGCCATCTC TCGGGAGCCT GGAGCGGCAA CGACCGAGAC CGCAGGCGTA
CCTGCGGCGG GTCCCACCGT TCTGGCAACC GGAATGCTGC CGGAAGGACA TTTGTTCAAG
CCTTTACTGG CCGATCCGCG CTGGGCGCAT TTCTCCGCCG CATATCGCAA CTATGTCGGA
AACAATATTG ACGGAAATAA CAATGCCGCT GTCAGTTTTG GTGAAACCAT TCCCTTCTAC
CGCGCGAATT TCGGACAATC TACTGTGCAG TGGGAAGCAG GTCTTCAGGC TGCTGTCTTC
AGCGACTTCA ATCTCGGCGC GCCTTCGTCC GATCTCATCA ATAGCGATTT CATAGCATCC
GCTTATGGAA GCGTGCGGGC AGGCCATTTT TCTGCTTTCG GCCGTATCTA TCATCAAAGC
TCTCATCTTG GAGACGAATT ATTGCTGCGC AGATTAACCA GCCTGCAGCG GATCAATCTC
AGCTATGAGG GAGCCGATCT CAGATTGTCG TATGAGCTTC CGTATGGATT GAGGGTTTAC
GGGGGTGGAG GTGGAATCTT TCACAAGGAA CCCTCGAACA TCAAGCCCTG GTCGATACAA
TATGGCGTCG AGTTTCGCAG CCCGTGGCGG ATCGCGTTTT TACCGCTGCG ACCGATCGTG
GCGGTTGACC TCAAGAACCA TCAGCAGAAC GACTGGAATG CCGATGTATC CGCGCGGGCA
GGTGTTCAAC TGGATCACTT CCGGGCATTC GGCCGCAATC TTCAGTTCCT GGTTGAGTAT
TTTCACGGAA ACTCCCCGAC GGGCCAGTTT TTCAGGCAGC GGGTGGATTA TCTCGGTATT
GGAGCGCACT ATCATTTCTG A
 
Protein sequence
MRYLNIIAAM LFFTSTVHAA VTEDAYIAGY AAGVLKHGFG MEIPTLVVKD GIITVPEDKL 
KSENQAQVVQ ALSKIPGVTG VTIAEKSVSR IAEREAFKPI QHPSAISREP GAATTETAGV
PAAGPTVLAT GMLPEGHLFK PLLADPRWAH FSAAYRNYVG NNIDGNNNAA VSFGETIPFY
RANFGQSTVQ WEAGLQAAVF SDFNLGAPSS DLINSDFIAS AYGSVRAGHF SAFGRIYHQS
SHLGDELLLR RLTSLQRINL SYEGADLRLS YELPYGLRVY GGGGGIFHKE PSNIKPWSIQ
YGVEFRSPWR IAFLPLRPIV AVDLKNHQQN DWNADVSARA GVQLDHFRAF GRNLQFLVEY
FHGNSPTGQF FRQRVDYLGI GAHYHF