Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2539 |
Symbol | |
ID | 8137881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2968090 |
End bp | 2969970 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644870148 |
Product | malto-oligosyltrehalose trehalohydrolase |
Protein accession | YP_003022338 |
Protein GI | 253701149 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0296] 1,4-alpha-glucan branching enzyme |
TIGRFAM ID | [TIGR02402] malto-oligosyltrehalose trehalohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0000000367182 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTGCGC TGCAAAGACG GCTACCGATA GGAGCGGAGG TTTTCCCGGA AGGTGTCCAC TTCCGTGTTT GGGCCCCGGC GCACCGGAAG GTGGAGGTGG TGCTGGAGGG AGGGGCGGGC CAGGGGAGTT CCCTGGAGCT GGCATCGGAG GAGGAGGGGT TTTTCGCCGG ACTAAGCCGC GAGGCGCGGA CGGGGTCGTT GTACCGCTTC CGGCTGGACG AGGCCGAGCA CCTCCACCCG GACCCCGCCT CGCGCTTCCA GCCCGACGGC CCCCACGGCC CCTCACGGGT GGTCGACCCC TCCCAATACC GGTGGAGCGA CGGCCAATGG CCCGGGATCG TGCGCGAGGG GCACGTGATC TACGAGATGC ACCTGGGGAC CTACACCCGG GAAGGGAGTT GGCGCGCCGC GGCGGACCAG CTCCAGGAAC TGAAGGAGTT GGGGATCACC CTGGTAGAAG TGATGCCGGT GGCGGATTTC CCCGGCAGGT TCGGCTGGGG GTACGACGGG GTGAACCACT TCGCCCCCGC GAGGCTCTAC GGCGAGCCGG ACGACATGCG CAGCTTCGTG GATCAAGCCC ACCGCCTTGG GCTTGGGGTG TTGCTGGACG TGGTCTACAA CCACTTCGGC CCGGAAGGGA ACTACCTGGC CCAGTTCTCC CAGTACTACT ACGCGGAGGA GGAAGGCGAA TGGGGCAAGG CGATGAACTT CGACGGCAGG CGCAGCCGCC CGGTAAGGGA GTTCTTCGCC GGCAACGCCG CCTACTGGAT CGAGGAGTTC CACCTGGACG GCCTGCGCTT CGACGCCACC CACGCCATTC GCGACGACTC CCCCATCCAT GTCCTGGGCG AGATCACCGA AAAGGCGCGG GCAGCCGCCG GAAAACGCAC CATCTTGTTG GTGGCCGAGA ACGAGTACCA GGACGTCCGC TGCCTGCGCG ACAAGGAAGC CGGCGGCTTC GGCATGGACG CGGTCTGGAA CGACGACTTC CACCACAGCG CCTACGTGGC GCTCACCGGC TATCACGACG CCTACTATTC GGAATACTTC GGCTCCCCCC AGGAGCTGGT CTCGGCTGCC AAGTGGGGCT ACCTGTACCA GGGGCAGTTC TACTTCTGGC AGGGGAAAAG GCGCGGCTCC CCAACCATCG GCGTGAGCCC CTCCTGCTTC GTCAACTACA TCCAAAACCA CGACCAGATC GGCAACTCCG CCTGGGGGAT GAGGATCGAC AGGCTCACCA ACCCGGCGGC CCTCAGGACC ATGACGGCGC TGCTGATGCT CGCCCCGCAG ACCCCGATGA TCTTCCAGGG GCAGGAGTTT TCCGCGAGCT CCCCTTTTCT CTATTTCGCG GACCTGAGCC CGGAGATCTC GCAGGCGGTG CACACCGGCC GCATCGAGTA CCTGAAGCAG TTCACCAACA TAGACGCCCC GGAAATCATC GACTCCATAG ATAAACCGTA CGAGCTGGAA ACCTTCGAGC AGTCGCGGCT TCGCTTAAGC GAGCGCGAGC GGAATGCCAA GACCTACGCG CTCTACCGCG ACCTGATCCG GCTGCGGCGC GAGGACCCGG TCTTCAGCCG CGCCTACGCC TGCCACATAG AGGGGGCGGT GCTGGGGGCG AGCGCGTTTT TACTCCGCTA TTTCCTGGAG GGTGACCAGC GCCTGCTCCT GATCAATCTG GGGCGCGAGC TGCATCTGGT GCCCATCCCG GAGCCTATGC TGGCGCCGCC TGAGGGATGC GACTGGGAGA TCCTCTGGAC CAGCGAGAAG CTGGAATTCG GCGGCTCGGG AACGCCCAAG CTCGACACGG AGAAGTTCTG GCGCGTACAG GGGAACGCGG CGGTGGTGCT GATTCCGGTC CGCCAGGGAG AAGAGCCATG A
|
Protein sequence | MAALQRRLPI GAEVFPEGVH FRVWAPAHRK VEVVLEGGAG QGSSLELASE EEGFFAGLSR EARTGSLYRF RLDEAEHLHP DPASRFQPDG PHGPSRVVDP SQYRWSDGQW PGIVREGHVI YEMHLGTYTR EGSWRAAADQ LQELKELGIT LVEVMPVADF PGRFGWGYDG VNHFAPARLY GEPDDMRSFV DQAHRLGLGV LLDVVYNHFG PEGNYLAQFS QYYYAEEEGE WGKAMNFDGR RSRPVREFFA GNAAYWIEEF HLDGLRFDAT HAIRDDSPIH VLGEITEKAR AAAGKRTILL VAENEYQDVR CLRDKEAGGF GMDAVWNDDF HHSAYVALTG YHDAYYSEYF GSPQELVSAA KWGYLYQGQF YFWQGKRRGS PTIGVSPSCF VNYIQNHDQI GNSAWGMRID RLTNPAALRT MTALLMLAPQ TPMIFQGQEF SASSPFLYFA DLSPEISQAV HTGRIEYLKQ FTNIDAPEII DSIDKPYELE TFEQSRLRLS ERERNAKTYA LYRDLIRLRR EDPVFSRAYA CHIEGAVLGA SAFLLRYFLE GDQRLLLINL GRELHLVPIP EPMLAPPEGC DWEILWTSEK LEFGGSGTPK LDTEKFWRVQ GNAAVVLIPV RQGEEP
|
| |