Gene Nmul_A1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1940 
Symbol 
ID3784236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2231010 
End bp2232791 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content57% 
IMG OID637812026 
Productglycoside hydrolase family protein 
Protein accessionYP_412627 
Protein GI82703061 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.590979 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGTTCG CGCACCCTCG ACCGCAGATG CAGCGCTCCA ATTGGACCTC GCTCAACGGA 
GTCTGGCGCT TTCGCTACGA CGAGGCGCGT ACTTTTACTC ACCCCTCTCA GATAGAATCC
TGGCCGATGG AAATCATCGT GCCGTTTCCA CCCGAATCCG AGGCAAGCGG AATCGGTGAC
CGCAGTTTTC ATTCACTCTG CTGGTATGAG CGCGATTTTG ATTTCGAGCC CTCCTCGGAG
CGGGTAATCC TGCATTTCGG TGCCGTGGAC TACTCCGCAA AGGTATGGGT CAACGGTCGT
TTTGCCGCCA GCCACGAAGG GGGGCATACT CCATTCTGGG GGGACATCAC CCCTCTGCTC
GATCCATCCG GAAAACAGAA GGTGACGGTA CAGGTGGAGG ACGATCCGCA CGAGCTGGCC
AAACCGCGTG GAAAGCAGGA TTGGCAGCTC GAGCCTCATG CCATCTGGTA TCCCCGTACC
ACTGGCATCT GGCAGATGGT CTGGATCGAG CGGGTATCCG AAAATTACAT TGAAAAGATT
CGGTGGACGC CCCAGGTCGA GATATATGCG ATAGGCTTCG AAGCGCGCGT AATAGGAGAA
GAGGCCGACG AACTGGCAGT AGACGTATGC TTGCGCCACG GGGAGCAGTT GCTCGCGCAT
GACCGCTATC GGGTAGTCGA ACGGGAAGTA GACCGCGTCA TCATATTGTC TGACCCTGGA
ATTGATGATT TTCGTAATGA GTTGCTATGG AGCCCGGAAC GTCCCACCTT GATCGATGCT
GTTGTACGAC TGATGCGGGG CGAGGAGGTG GTCGACGAGT TCATCTCCTA CACTGCAATG
CGTTCCGTCA ACATCCTGCG CGACCGCTTC ATGCTGAACG GTCGTCCCTA CACGCTGAGG
CTCGTGCTTG ACCAGGGCTA CTGGCCGGAG ACGCTGCTGG CTGCGCCAAG CGACGACGCC
CTGCGAGGCG ATGTGGAACT TGCAAAGGCA ATGGGCTTCA ACGGGGTGCG CAAGCATCAA
AAGATAGAGG ACCCGCGCTA TCTTTATTGG GCGGACAGGC TCGGGCTGAT GGTATGGGAA
GAAATGCCTT CCGCATATCG CTTCACCCGC AGCGCCATCA AGAGGATGGT GCGGGAATGG
ACGGAGGCCA TCGAGCGGGA TTACAGCCAT CCCTGCGTCA TTGTATGGGT ACCTTTCAAT
GAATCCTGGG GAGTACCGGA ACTTACCGCG GTCCGTAAAC AGCGGCACGC TGTCGAGGCA
CTATATCACT TGACCAAAAC CCTGGATGCG ACGCGCCCGG TAATCGGCAA CGATGGATGG
GAAAGCAGTG CTACGGATAT CATCGGTATT CACGATTATG ACGCAAACAT TGAACATCTG
CGCCAGCGTT ATGGCGCCGA AATAAAACCT GAACAGTTGT TCGACCGTCG GCGCCCGGGG
GGGCGGATTC TCACCCTTGA TGGCTACCCG CATCGAGGCC AGCCGATCAT GTTAAGCGAA
TTCGGGGGGA TTGCTTTCGC CAAGTGCCCG CAACCCGGCG TCGAGCATAC TTGGGGCTAT
ACTGTTGCCC ACGCCGAAGA GGAATTTGCG CGTATGTATG CCGAGTTGAT GCATACAGTG
ATTCATACGG CTCTCTTCAG CGGCTTTTGC TATACCCAGT TTGCCGATAC CTTTCAGGAA
GCGAACGGAC TGCTGTGCGC GGATCGTACT CCCAAGATTC CCATTGAGCA AATCGCCCGC
GTCACGCGCA TCTCGCCCAC CTATATACCC GGGGGTGTTT AG
 
Protein sequence
MKFAHPRPQM QRSNWTSLNG VWRFRYDEAR TFTHPSQIES WPMEIIVPFP PESEASGIGD 
RSFHSLCWYE RDFDFEPSSE RVILHFGAVD YSAKVWVNGR FAASHEGGHT PFWGDITPLL
DPSGKQKVTV QVEDDPHELA KPRGKQDWQL EPHAIWYPRT TGIWQMVWIE RVSENYIEKI
RWTPQVEIYA IGFEARVIGE EADELAVDVC LRHGEQLLAH DRYRVVEREV DRVIILSDPG
IDDFRNELLW SPERPTLIDA VVRLMRGEEV VDEFISYTAM RSVNILRDRF MLNGRPYTLR
LVLDQGYWPE TLLAAPSDDA LRGDVELAKA MGFNGVRKHQ KIEDPRYLYW ADRLGLMVWE
EMPSAYRFTR SAIKRMVREW TEAIERDYSH PCVIVWVPFN ESWGVPELTA VRKQRHAVEA
LYHLTKTLDA TRPVIGNDGW ESSATDIIGI HDYDANIEHL RQRYGAEIKP EQLFDRRRPG
GRILTLDGYP HRGQPIMLSE FGGIAFAKCP QPGVEHTWGY TVAHAEEEFA RMYAELMHTV
IHTALFSGFC YTQFADTFQE ANGLLCADRT PKIPIEQIAR VTRISPTYIP GGV