Gene Nmul_A0634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0634 
Symbol 
ID3785407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp722327 
End bp723550 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content54% 
IMG OID637810716 
ProductBNR repeat-containing glycosyl hydrolase 
Protein accessionYP_411333 
Protein GI82701767 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGCCC TCCCTGGTTT TGCCGGACTC TTTCTAGCGC TTGTTTCCGT AGCTGTCGTC 
GCAGCAGAAC CGCATTCGAG CGGTGAAACC CATCATCATC CGGCAATGCA TTCCGCAAAA
TCCGTCAAGA CTGCGCTGGC GGTGGGAGTG ACGCTGGATC AGGATGGGCG ATTATGGCTG
GCGAGAGTTG TCGACCAGCA TTTGCTGGTT TCCTGGTCGG AAGATAGCGG AAGCAGTTTT
TCCGAACCCG CGGTCGTAAC ACCTGAACCG GAAAACATCT CCATCGATGG CGAGAATCGC
CCCAAGATCG AGGTTGCGCG TGATGGCAGC GTACTTGTAA CCTGGACGCA GGTTCTTCCG
CAAAAATATT CCGGCAATGT GAGGTTTTCA CGTTCAATCG ACTCGGGCCG GACGTTTTCA
AAACCCATTA CCCTCAATGA CGATGGCCGC GTTACCAGTC ACCGCTTCGA CTCTCTGGCA
ATCGACGGGG ACGGGAGGGT GATAGTTGCT TGGCTGGACG CAAGGGATCG CGATGCAGCA
AGGGAAAAAG GCGAAGAGTA CCGGGGTGTA TCGCTCTATA CCACGCAATC ATTCAACAAT
GGCGAGAGTT TTGGCCGGAA TCGAAGAATC CAGGAGCACA CGTGCGAATG CTGTCGGACG
GCGCTTATCT GGAGCAGGGA GGGGCCAATC GTTTTACTGC GGAATATTTT TGGTGCCAAT
ACCCGTGATT TTGCGCTGAT CAATCTCGAC AAGGGCGGCA TAAGAAGGGT AAACCGTGAC
GAATGGCAGG TCGATGCGTG TCCGCACAAT GGAGGAAGCC TTGCAACGGA CCGAAGGGGC
CAGTTGCATC TCGTCTGGTT CACAAATGGC CCGGCAGATC AGGGATTATT CTATAAGCGG
ATCGATGGCA ATTGGGAATC GAAACCCAAG CCGATAGGCA ATGCGGACGC GCAGGCAAAT
CATGCTTCCG TGGTTGCCGA TGGAGAAACC GTCATTCTTA CCTGGCGTGA ATTCGATGGA
AATGCTTATT CCGCAAAGAT GATGTACTCG AATGATAGTG GCGAATCTTG GAGTGAACCG
ATGCGCCTGA TGGAATCCGA TGGCGCGACA GACTACCCTA TCCCGCTGAT CGATAACAGG
AAAGTTCTGA TCGTCTGGAA TACTGCAAAG GAAGGCCTGC GTATTTTACC GATCGAGCGG
GTGACCGCCC GGTATTCCGG TTAG
 
Protein sequence
MFALPGFAGL FLALVSVAVV AAEPHSSGET HHHPAMHSAK SVKTALAVGV TLDQDGRLWL 
ARVVDQHLLV SWSEDSGSSF SEPAVVTPEP ENISIDGENR PKIEVARDGS VLVTWTQVLP
QKYSGNVRFS RSIDSGRTFS KPITLNDDGR VTSHRFDSLA IDGDGRVIVA WLDARDRDAA
REKGEEYRGV SLYTTQSFNN GESFGRNRRI QEHTCECCRT ALIWSREGPI VLLRNIFGAN
TRDFALINLD KGGIRRVNRD EWQVDACPHN GGSLATDRRG QLHLVWFTNG PADQGLFYKR
IDGNWESKPK PIGNADAQAN HASVVADGET VILTWREFDG NAYSAKMMYS NDSGESWSEP
MRLMESDGAT DYPIPLIDNR KVLIVWNTAK EGLRILPIER VTARYSG