Gene GM21_2539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2539 
Symbol 
ID8137881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2968090 
End bp2969970 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content65% 
IMG OID644870148 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_003022338 
Protein GI253701149 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0000000367182 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGCGC TGCAAAGACG GCTACCGATA GGAGCGGAGG TTTTCCCGGA AGGTGTCCAC 
TTCCGTGTTT GGGCCCCGGC GCACCGGAAG GTGGAGGTGG TGCTGGAGGG AGGGGCGGGC
CAGGGGAGTT CCCTGGAGCT GGCATCGGAG GAGGAGGGGT TTTTCGCCGG ACTAAGCCGC
GAGGCGCGGA CGGGGTCGTT GTACCGCTTC CGGCTGGACG AGGCCGAGCA CCTCCACCCG
GACCCCGCCT CGCGCTTCCA GCCCGACGGC CCCCACGGCC CCTCACGGGT GGTCGACCCC
TCCCAATACC GGTGGAGCGA CGGCCAATGG CCCGGGATCG TGCGCGAGGG GCACGTGATC
TACGAGATGC ACCTGGGGAC CTACACCCGG GAAGGGAGTT GGCGCGCCGC GGCGGACCAG
CTCCAGGAAC TGAAGGAGTT GGGGATCACC CTGGTAGAAG TGATGCCGGT GGCGGATTTC
CCCGGCAGGT TCGGCTGGGG GTACGACGGG GTGAACCACT TCGCCCCCGC GAGGCTCTAC
GGCGAGCCGG ACGACATGCG CAGCTTCGTG GATCAAGCCC ACCGCCTTGG GCTTGGGGTG
TTGCTGGACG TGGTCTACAA CCACTTCGGC CCGGAAGGGA ACTACCTGGC CCAGTTCTCC
CAGTACTACT ACGCGGAGGA GGAAGGCGAA TGGGGCAAGG CGATGAACTT CGACGGCAGG
CGCAGCCGCC CGGTAAGGGA GTTCTTCGCC GGCAACGCCG CCTACTGGAT CGAGGAGTTC
CACCTGGACG GCCTGCGCTT CGACGCCACC CACGCCATTC GCGACGACTC CCCCATCCAT
GTCCTGGGCG AGATCACCGA AAAGGCGCGG GCAGCCGCCG GAAAACGCAC CATCTTGTTG
GTGGCCGAGA ACGAGTACCA GGACGTCCGC TGCCTGCGCG ACAAGGAAGC CGGCGGCTTC
GGCATGGACG CGGTCTGGAA CGACGACTTC CACCACAGCG CCTACGTGGC GCTCACCGGC
TATCACGACG CCTACTATTC GGAATACTTC GGCTCCCCCC AGGAGCTGGT CTCGGCTGCC
AAGTGGGGCT ACCTGTACCA GGGGCAGTTC TACTTCTGGC AGGGGAAAAG GCGCGGCTCC
CCAACCATCG GCGTGAGCCC CTCCTGCTTC GTCAACTACA TCCAAAACCA CGACCAGATC
GGCAACTCCG CCTGGGGGAT GAGGATCGAC AGGCTCACCA ACCCGGCGGC CCTCAGGACC
ATGACGGCGC TGCTGATGCT CGCCCCGCAG ACCCCGATGA TCTTCCAGGG GCAGGAGTTT
TCCGCGAGCT CCCCTTTTCT CTATTTCGCG GACCTGAGCC CGGAGATCTC GCAGGCGGTG
CACACCGGCC GCATCGAGTA CCTGAAGCAG TTCACCAACA TAGACGCCCC GGAAATCATC
GACTCCATAG ATAAACCGTA CGAGCTGGAA ACCTTCGAGC AGTCGCGGCT TCGCTTAAGC
GAGCGCGAGC GGAATGCCAA GACCTACGCG CTCTACCGCG ACCTGATCCG GCTGCGGCGC
GAGGACCCGG TCTTCAGCCG CGCCTACGCC TGCCACATAG AGGGGGCGGT GCTGGGGGCG
AGCGCGTTTT TACTCCGCTA TTTCCTGGAG GGTGACCAGC GCCTGCTCCT GATCAATCTG
GGGCGCGAGC TGCATCTGGT GCCCATCCCG GAGCCTATGC TGGCGCCGCC TGAGGGATGC
GACTGGGAGA TCCTCTGGAC CAGCGAGAAG CTGGAATTCG GCGGCTCGGG AACGCCCAAG
CTCGACACGG AGAAGTTCTG GCGCGTACAG GGGAACGCGG CGGTGGTGCT GATTCCGGTC
CGCCAGGGAG AAGAGCCATG A
 
Protein sequence
MAALQRRLPI GAEVFPEGVH FRVWAPAHRK VEVVLEGGAG QGSSLELASE EEGFFAGLSR 
EARTGSLYRF RLDEAEHLHP DPASRFQPDG PHGPSRVVDP SQYRWSDGQW PGIVREGHVI
YEMHLGTYTR EGSWRAAADQ LQELKELGIT LVEVMPVADF PGRFGWGYDG VNHFAPARLY
GEPDDMRSFV DQAHRLGLGV LLDVVYNHFG PEGNYLAQFS QYYYAEEEGE WGKAMNFDGR
RSRPVREFFA GNAAYWIEEF HLDGLRFDAT HAIRDDSPIH VLGEITEKAR AAAGKRTILL
VAENEYQDVR CLRDKEAGGF GMDAVWNDDF HHSAYVALTG YHDAYYSEYF GSPQELVSAA
KWGYLYQGQF YFWQGKRRGS PTIGVSPSCF VNYIQNHDQI GNSAWGMRID RLTNPAALRT
MTALLMLAPQ TPMIFQGQEF SASSPFLYFA DLSPEISQAV HTGRIEYLKQ FTNIDAPEII
DSIDKPYELE TFEQSRLRLS ERERNAKTYA LYRDLIRLRR EDPVFSRAYA CHIEGAVLGA
SAFLLRYFLE GDQRLLLINL GRELHLVPIP EPMLAPPEGC DWEILWTSEK LEFGGSGTPK
LDTEKFWRVQ GNAAVVLIPV RQGEEP