Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_4107 |
Symbol | |
ID | 8139481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4689534 |
End bp | 4691321 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644871722 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003023880 |
Protein GI | 253702691 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 103 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTATC CCCGCCCGCA ATTGCAACGC CCCAACTGGC ATACGCTCGA CGGCCAGTGG CACTTCAGCT TCGACGACGA TCTCTCTTAC AGCATTCCGG CCGACGTGAA AGAGTGGCCC CTCACCATTG AGGTACCGTA CGCGCCCGAG AGTGAGGCAA GCGGCATCGG CGACACCTCC TTTCACCATG CCTGCTGGTA CCAGCGCGAG TTCTTCTACG ACGCGAGCGA CGGGAAGAAG GTGCAGCTCC ATTTCGGCGC CGTCGATTAC CTGGCGCACG TCTGGGTCAA CGACATGCTG GTGGCGCGGC ACCGAGGCGG CTTCACCCCG TTCAGCGCCG ACATCACCCA CGCCCTCGAT CCTTCCGGGC GCCAGGTCGT GACCGTGCGG GTCGAGGACG ACCCTGAGGA TCTGGAAAAG CCGCGCGGAA AGCAGGACTG GAAGCTGGAG CCGCACGCGA TCTGGTACTA CCGCACCACC GGCATCTGGC AGACCGTCTG GCTGGAATCG GTCCCCGAGA CCTACCTCAA GAAGCTGCGC TGGTCGCCGC ACCTGGAGCG CTGGGAGGTG GGATTCGAGG CCTTCATCGT GGGGCCTATC GCGGACCAGA TGGAAGTGAA CGTGCGGCTT TCCTTCAGCG ACCAGCTCCT GGTCGAGGAC CGCTACCGGG TGATCCACGG CGAGGTGCAC CGGCGCATCG CCCTCTCCGA CCCGGGGATC GACGACTTCA GGAACGACCT CCTCTGGTCC CCGGAGCACC CGAGGCTGAT CCACGCCGAC ATCAAGCTGA TGCAGGGGGG CGAGGTGATC GACGAGATCA CCTCCTACAC CGCGCTCCGT TCCGCCAAGG TGCACCGGGA CCGGTTCCTC CTGAACGGGC GGCTCTATCC GCTCAGGCTG GTGCTGGACC AGGGTTACTG GCCGGAAACG CTGATGACTC CCCCCTCCGA CGAGGCGCTC AAGCGGGACG TCGAGCTCAC CAAGGCGATG GGGTTCAACG GGGTGAGAAA GCACCAAAAG CTGGAGGACC CGCGCTACCT GTACTGGGCG GATCGGCTGG GGCTCGTGGT CTGGTCGGAG ATGCCGAGCG CCTTCCGCTT CACCACGCGC GCGATAAAAA GGCTGATGCG GGAGTGGATC GAGGCGATCG ATCGTGATTA CAGCCACCCC TGCGTCATCG TCTGGGTCCC CTTCAACGAG TCGTGGGGGG TCCCTGACTT GACGGCCACC AAGGCGCACC GGGACGCGGT GCACGCCTTC TACCACCTCA CCAAGACGCT CGACCCCGAG CGGCCGGTGA TCGGCAACGA CGGCTGGGAA AGCTCCGCCA CCGACATCAT CGGCATCCAC GACTACGACA ACAACCCCGA GAGGCTCGCG GAGCGCTACG GCCCGCAGGT GAAGCCGGAG GAGCTCTTCG ACCGGCGCCG CCCGGGCGGG CGCATCCTCA CCCTCGACGG CTATCCGCAC CGGGGGCAGC CCATCATCCT CACCGAGTTC GGCGGCGTCG CCTACGTAAA ACCCTCGGAC GCCCTGCACC AGAAGGCGTG GGGGTATTTC CGCCACGACA AGATCGAGGA GTTCGAACGC CTCGCCCTCT CCCTCATCGA GACCGCCCGC GGCGTCGCCA TGTTCAGCGG CTTTTGCTAC ACGCAATTCG CCGACACCTT CCAGGAGGCC AACGGCCTCC TCTTCGCGGA CCGGACCCCG AAGATCCCGC TGGAGCGGAT CGCCGACGCG GTGCGTGGGA AGGTCGAACA GGGCGCGCTC TTCTGGGAAA GCACCTGA
|
Protein sequence | MDYPRPQLQR PNWHTLDGQW HFSFDDDLSY SIPADVKEWP LTIEVPYAPE SEASGIGDTS FHHACWYQRE FFYDASDGKK VQLHFGAVDY LAHVWVNDML VARHRGGFTP FSADITHALD PSGRQVVTVR VEDDPEDLEK PRGKQDWKLE PHAIWYYRTT GIWQTVWLES VPETYLKKLR WSPHLERWEV GFEAFIVGPI ADQMEVNVRL SFSDQLLVED RYRVIHGEVH RRIALSDPGI DDFRNDLLWS PEHPRLIHAD IKLMQGGEVI DEITSYTALR SAKVHRDRFL LNGRLYPLRL VLDQGYWPET LMTPPSDEAL KRDVELTKAM GFNGVRKHQK LEDPRYLYWA DRLGLVVWSE MPSAFRFTTR AIKRLMREWI EAIDRDYSHP CVIVWVPFNE SWGVPDLTAT KAHRDAVHAF YHLTKTLDPE RPVIGNDGWE SSATDIIGIH DYDNNPERLA ERYGPQVKPE ELFDRRRPGG RILTLDGYPH RGQPIILTEF GGVAYVKPSD ALHQKAWGYF RHDKIEEFER LALSLIETAR GVAMFSGFCY TQFADTFQEA NGLLFADRTP KIPLERIADA VRGKVEQGAL FWEST
|
| |