Gene Nmul_A0959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0959 
Symbol 
ID3785750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1112791 
End bp1114050 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content55% 
IMG OID637811042 
Producthypothetical protein 
Protein accessionYP_411654 
Protein GI82702088 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID[TIGR02595] PEP-CTERM putative exosortase interaction domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAAAC CCTTTTCTGC CCTGCTTGCC TTGGTCTTCG CCACTGGTCT TGGGGCACCG 
GAGGCTTTTG CACAGTTGCA AACCCGGATC GTTGCAAACG GCCTTGATCG CCCTTTGTTC
GCAACATCGC CAGCAGGAGA CTCACGCCTG TTCATTGTCG AACAGGGCGG GTTGATCAAA
ATCCTCCAGA ATGGGAGTGT GCAGCCAACA CCATTTCTCG ATCTGTCAGG CTCGGTCAAT
ACCGAGGGCG AACGCGGCCT GCTGGGAATG ACATTCGATC CAAACTTTGC CAGCAACCGT
CGATTTTATG TCGATTACAT AGACAGGACT TCGCTCAATA CCGTAGTCGC AACGTATCAG
GTAAGTGCAA CGCAGCCAAA CGTGGCAGAT ATCACCAGCC GACAAACAGT GCTCACTGTG
CAGCAACCTG AGTTCAACAA TCACAAGGCC GGGTGGCTCG GCTTCAGGCC CGGCGAGCCG
GGGAATCTTT ACATAGCCAC AGGAGACGGC GGCCTCCGGG ATGATCCTGG TAACCGCGCG
CAAAACCTGT CCAGCAATCT CGGCAAAATT TTGCGAATCG ATGTTTCATC TGATCGCCTG
CCCAATGATC CCACGCAGTA TGGCTATGCA ATACCGGATG GAAATGCCAC AGGCAGCAAT
CCGGAGATAT ATGCATCTGG ATTGCGTAAC CCGTTTCGTG ACAGTTTTGA CCGGGAAAAC
GGCACTTTCT ACATCGGTGA CGTCGGGCAG AACGCACGCG AGGAAATCGA TATAGGCGCT
GCCGGAGCGA ACTATGGGTG GCGCAGATTC GAGGGAACGC TGGTGAATTT TCCTAACGAT
CCACAGATTC CCAACCACAC GCCGCCCATC TTTGAATATA ACCATACCGC GGATGGCGCC
TCAGTCATTG GCGGCTATGT CTACCGCGGC TCAGAAATCC CCGGCCTGGA AGGCACCTAC
TTCTTTGCGG ACTTCGTCAA CGACAAGGTG ATGTCCTTCC GCTTTACTGG CTCAGGCATT
ACCGACCTCA CCGACCGCAC TGCCGAACTG CTCTCCCCAA CGGGCATTTC GGGGAATATC
ACCTCATTCG GCGAGGATGC TTCCGGCAAC CTTTATCTGG TAAGCCTCAA TGGGCAAGTC
GGACGAATCG CCCTTATACC CGAACCTGCA AGCTACGCAA TGATGCTGGC GGGGATGGGT
TTGATCGGGG TGTGGGTTAG GCGAAGAGGG AAAGCGAGGA AAGTCCCAGG GCTCACCTAA
 
Protein sequence
MIKPFSALLA LVFATGLGAP EAFAQLQTRI VANGLDRPLF ATSPAGDSRL FIVEQGGLIK 
ILQNGSVQPT PFLDLSGSVN TEGERGLLGM TFDPNFASNR RFYVDYIDRT SLNTVVATYQ
VSATQPNVAD ITSRQTVLTV QQPEFNNHKA GWLGFRPGEP GNLYIATGDG GLRDDPGNRA
QNLSSNLGKI LRIDVSSDRL PNDPTQYGYA IPDGNATGSN PEIYASGLRN PFRDSFDREN
GTFYIGDVGQ NAREEIDIGA AGANYGWRRF EGTLVNFPND PQIPNHTPPI FEYNHTADGA
SVIGGYVYRG SEIPGLEGTY FFADFVNDKV MSFRFTGSGI TDLTDRTAEL LSPTGISGNI
TSFGEDASGN LYLVSLNGQV GRIALIPEPA SYAMMLAGMG LIGVWVRRRG KARKVPGLT