Gene TBFG_12496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_12496 
Symbol 
ID5223175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp2788238 
End bp2789878 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content67% 
IMG OID640607255 
Productalpha-glucosidase aglA (maltase) 
Protein accessionYP_001288425 
Protein GI148823671 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones281 
Plasmid unclonability p-value0.447411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones188 
Fosmid unclonability p-value0.123805 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAAC ACCAGCGACC GGATCCAATG GGCCCCGGCT CTCCTCGCGC CAGCGCTCGT 
CGACCGGAGC CAGATCCGAT GGGCGAGCCG TGGTGGTCGC GAGCCGTGTT CTACCAGGTC
TATCCCCGAT CGTTCGCCGA CAGCAACGGC GACGGGGTGG GCGACCTGGA CGGGTTGGCG
AGCCGGCTTG ACCACCTGCA ACAGCTCGGT GTCGACGCGA TCTGGATCAA CCCGGTCACC
GTCTCGCCGA TGGCAGACCA CGGATACGAC GTCGCCGATC CCCGCGACAT CGACCCACTC
TTCGGCGGGA TGCCGGCGTT CGAACGGTTG GTCGCTGCGG CACACCGGCA GGGCATCAAA
GTCACCATGG ACGTGGTGCC CAACCACACC AGTTCGGCGC ACCCATGGTT TCAGGCCGCG
CTGGCTGACC TCCCGGGTAG CCCGGCGCGG GATCGCTATT TCTTTCGCGA CGGGCGGGGC
CCCGACGGGT CGCTGCCGCC GAACAACTGG GAGTCGGTGT TCGGCGGGCC GGCCTGGACC
CGAGTGCGCG AACCGGACGG CAACCCGGGC CAGTGGTACC TGCACCTTTT CGACACCGAA
CAGCCGGACC TGAACTGGGA CAACCCGGAA ATCCTTGACG ACTTCGAGAA AACACTGCGC
TTCTGGCTGG ACCGCGGCGT GGATGGCTTC CGCATCGACG TGGCGCACGG CATGGCCAAG
CCCCCGGGCC TGCCGGACTC ACCGGACCTG GGCATCGAGG TGCTGCACCA CCGCGATGAC
GACCCGCGCT TCAACCACCC GAATGTGCAC GCGATTCACC GCGACATCCG CACGGTGATC
GACGAGTACC CCGGAGCGGT AACCGTCGGC GAGGTGTGGG TACACGACAA CGCCCGCTGG
GCGGAGTATC TGCGGCCCGA CGAACTGCAT CTCGGCTTCA ATTTCCGGCT GGCGCGAACC
GAGTTCGACG CCGCCGAGAT CCGCGACGCG GTGGCGAACT CCCTGGCCGC CGCGGCGCTG
CAGAACGCGA CCCCAACCTG GACGCTGGCC AATCACGATG TGGGACGGGA GGTTAGCCGC
TACGGCGGCG GCGAGATCGG GCTGCGCCGG GCCAAGGCGA TGGCGGTGGT GATGCTCGCC
CTGCCGGGCG TGGTCTTCCT CTACAACGGC CAGGAACTGG GTTTGCCCGA CGTGGACCTG
CCCGACGAGG TGCTGCAGGA TCCGACGTGG GAACGCTCGG GACGCACCGA ACGCGGTCGC
GATGGCTGCC GGGTGCCGAT TCCCTGGTCG GGCAACATTC CCCCGTTCGG GTTCTCGACG
TGTCCAGACA CCTGGTTGCC GATGCCGCCG GAATGGGCGG CGCTGACCGC CGAAAAACAA
CGCGCTGATG CCGGCTCGAC CTTGTCGTTT TTTCGACTTG CACTCAGATT ACGTAGGGAA
CGAAATGAAT TCGACGGCGA CGTCGACTGG CTGGCCGCGC CCGACGATGC GCTGATATTC
CGGCGTCACG GCGGGGGTTT GGTGTGCGCG CTCAACGCCG CTGAGCGTCC GCTGGCGCTG
CCGGCAGGTG AACCCATCCT GGCCAGCGCA CCGTTGACCG ACGCCACGTT GCCACCCAAT
GCCGCGGCCT GGCTGGTGTA G
 
Protein sequence
MDQHQRPDPM GPGSPRASAR RPEPDPMGEP WWSRAVFYQV YPRSFADSNG DGVGDLDGLA 
SRLDHLQQLG VDAIWINPVT VSPMADHGYD VADPRDIDPL FGGMPAFERL VAAAHRQGIK
VTMDVVPNHT SSAHPWFQAA LADLPGSPAR DRYFFRDGRG PDGSLPPNNW ESVFGGPAWT
RVREPDGNPG QWYLHLFDTE QPDLNWDNPE ILDDFEKTLR FWLDRGVDGF RIDVAHGMAK
PPGLPDSPDL GIEVLHHRDD DPRFNHPNVH AIHRDIRTVI DEYPGAVTVG EVWVHDNARW
AEYLRPDELH LGFNFRLART EFDAAEIRDA VANSLAAAAL QNATPTWTLA NHDVGREVSR
YGGGEIGLRR AKAMAVVMLA LPGVVFLYNG QELGLPDVDL PDEVLQDPTW ERSGRTERGR
DGCRVPIPWS GNIPPFGFST CPDTWLPMPP EWAALTAEKQ RADAGSTLSF FRLALRLRRE
RNEFDGDVDW LAAPDDALIF RRHGGGLVCA LNAAERPLAL PAGEPILASA PLTDATLPPN
AAAWLV