Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2309 |
Symbol | |
ID | 3786714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2626346 |
End bp | 2628229 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637812396 |
Product | glycoside hydrolase 15-like protein |
Protein accession | YP_412992 |
Protein GI | 82703426 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.451271 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAAC CTATCGAAGA ATACGGATTC ATCGGTAACA TGCTGAGCGG AGCGCTCGTC GCCCGTGATG GCTCGATGGA TTGGCTTTGC CTTCCACGGT TCGATTCCGA TGCATGCTTC GCCGCACTGC TGGGGGCATC CAAGCATGGT TATTGGCAGA TTTCCCCGGC TGGCGAGAGT AACCGCTCTA GTCGGCGTTA TCTGCCCCAT GCACCGGTCC TCGAAACCAC CTTCGAGACA GAGGGTGGCA TCGTGACGCT CGTGGATTTC ATGCCGCTTT CAAATGACCC CGAGAGAGTC GATGTCGTTC GCCTTGTACA GGGGGTGTCG GGCAAGGTGC GGATGCGGAT GGAGTTGGCC CTGCGCTTCG GCTACGGCAA GACCATTCCC TGGGTACGCC GCCGCGACTA CGGCATCCAC GCCGTTGCCG GCCCGGATGC GGTGGAACTG GCAACTCCTG TCACCCTGCG CGGCGAGGAC ATGCGCACCG TTGCCGAGTT CGAGGTGGGA GAAGGCGACG TTATCCCCTT CACGCTCGCC TATCATCCCC TACATCGAGA GCCTCACTTC ATTGACGACG GCAGGAAGAG GCTCGAGCAC ACACTTGCCT GGTGGCGGGA ATGGACCCGT ATCTGCCAGC TCTCGGAACT GGAGGAACCC GGATGGAACG ATGCAGTCGA GCGTTCGCTC ATTACGCTCA AGGCACTCTC CTACCAACCA AGCGGGGGCA TCGTCGCGGC GCTTACGACC TCCCTCCCCG AAGAGCTCGG CGGAGTCCGA AACTGGGATT ACCGCTACTG CTGGATCCGC GACGCCACAC TCACGTTATA TGCATTCATG AATGCAGGAT GCTTCGACGA GGCCGGGGCA TTCAGGGAAT GGATGCTTCG AGCGGCTGCG GGCGCCCCTG ATCAGATGCA AATCATGTAT GGGATCGAGG GTGAACGCCG CCTGACTGAA ATCGAACTGC CTTGGCTGCC GGGTTACGAG AACAGCCTCC CCGTGCGCAT CGGCAACGGT GCGCATGAGC AGATTCAGGT AGACGTGTTT GGCGAACTGA TGGACACACT TTATACCGCC CGCAAGTCGC AACTCGGGCC GCATCAGGAA GCCTGGCGGT TTCAGCAAGC GATTCTTTCC CGACTGGAGA GTCTGTGGCG TGGGCCAGAC CAAGGCATCT GGGAGGTGCG CGGTAGCCCC AAGCACTTCG TCTATTCGAA AATGATGGCT TGGGTCGCGT TTGATCGGGC CATAAAAGCT GTGGAGCAAT TCGGCTTCCC TGGTCCTGTC GGGAAATGGC GCACGCTCCG CGATGAAATT CATCGGGAAG TGCTGGCGCG CGGCTATGAT AAGGAACGGA ACACATTCGT GCAGCATTAC GACGGCGTGG GACTGGATGC CTCCCTGCTG CTGATGGCCG AGGTAGGATT TCTCCCACCG GACGATCCCC GCTTTCGGGG AACGGTAGAA GCCATTGAAC GCGACCTGAT GGAAGATGGA CTTGTGCTGC GTTACCGCGT TGGCGAAACC AAAGACGGGC TCGCCGGCGA GGAAGGAACT TTTCTCGTTT GCAGCTTCTG GCTTGCTGAT GCCTATACAA TGATCGACCG CGGTCACGAT GCGGCAGTTC TTTTCGAGCG CCTCCTGTCT CTACGCAACG ACCTCGGGCT TCTTGCCGAG GAATACCACC CCCGCCACCG GCGGCAACTG GGAAACTTCC CCCAAGCGTT TTCCCACGTG GGTTTGATCA ATACGGCATA CAACCTTCGC CGCATCAACG GCCCCGCCCA GCAGCGCGCT GATCGCAGCG CGTCGCCTCA TGCCACTCAC ACCTCCTGGA CCGGAACCGC GGGCGAGCAG CACCGCGAAC GATCGATAGA CTAA
|
Protein sequence | MSKPIEEYGF IGNMLSGALV ARDGSMDWLC LPRFDSDACF AALLGASKHG YWQISPAGES NRSSRRYLPH APVLETTFET EGGIVTLVDF MPLSNDPERV DVVRLVQGVS GKVRMRMELA LRFGYGKTIP WVRRRDYGIH AVAGPDAVEL ATPVTLRGED MRTVAEFEVG EGDVIPFTLA YHPLHREPHF IDDGRKRLEH TLAWWREWTR ICQLSELEEP GWNDAVERSL ITLKALSYQP SGGIVAALTT SLPEELGGVR NWDYRYCWIR DATLTLYAFM NAGCFDEAGA FREWMLRAAA GAPDQMQIMY GIEGERRLTE IELPWLPGYE NSLPVRIGNG AHEQIQVDVF GELMDTLYTA RKSQLGPHQE AWRFQQAILS RLESLWRGPD QGIWEVRGSP KHFVYSKMMA WVAFDRAIKA VEQFGFPGPV GKWRTLRDEI HREVLARGYD KERNTFVQHY DGVGLDASLL LMAEVGFLPP DDPRFRGTVE AIERDLMEDG LVLRYRVGET KDGLAGEEGT FLVCSFWLAD AYTMIDRGHD AAVLFERLLS LRNDLGLLAE EYHPRHRRQL GNFPQAFSHV GLINTAYNLR RINGPAQQRA DRSASPHATH TSWTGTAGEQ HRERSID
|
| |