Gene TM1040_3309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3309 
Symbol 
ID4075713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp317847 
End bp319181 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content60% 
IMG OID638004817 
ProductBeta-glucosidase 
Protein accessionYP_611543 
Protein GI99078285 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.581632 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.566627 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGATT TCACTTTTAC ACGCCGCGAC TTTCCAGGGG ATTTCCTGTT CGGCTGCGCG 
ACGTCTTCCT ATCAGATTGA AGGCCATCAA TATGGCGGCG CAGGGCCGAC CCATTGGGAT
AGTTTTGCCG CAACACCCGG CAATGTGGTG CGTTCTGAGG ATGGCGCACG CGCCTGTGAC
CACTACCACC GCTTTGAGGA GGACCTCGAT CTCGCCGCTG CAGCGGGGTT TGAGTGCTAT
CGGTTCTCGA CCAGTTGGGC ACGCGTGCTG CCCGAGGGGC GTGGCACGCC CAATGCCGAG
GGGCTCGATT TCTACGACCG GCTGACCGAC GCGATGCTCG AACGCGGCCT GAAGCCCTGC
GCGACGCTCT ACCACTGGGA GCTGCCGCAG CCACTGGCCG ACATGGGCGG CTGGCGCAAT
CGAGATGTAA GCAACTGGTT TGCCGAATTC ACCGAAGTCA TCATGTCCCG GATCGGCGAT
CGGATGTATT CCGTGGCACC CATCAACGAA CCCTGGTGCG TTGGGTGGCT TTCGCATTTT
CTGGGCCATC ACGCGCCCGG TCTGCGCGAC ATCCGCGCCA CCGCACGGGC CATGCACCAT
GTCCTGTTGT CCCATGGCCG CGCCATCGAA GTGATGCGGG GCCTTGGAAT GAACAACCTT
GGTGCGGTGT TCAATTTTGA ATGGGCGGAG CCGCTGGATC AGAGTGCCCA AGCTCAGGCC
GCGGCAGAAA CCTATGATGC GATCTACAAC CGCTTTTTCC TCGGCGGGGT CTTTAAAGGC
GCTTATCCCG AAGCAGCCTT GCGCGGGCTC GAACCCCATT TGCCGCAGGG CTGGCAGGAC
GATTTCGCCA CCATCACCCA AAAGGTCGAT TGGTGCGGTT TGAACTATTA CACTCGCAAG
GTGATCGGTC CAGACAATGG CCCCTGGCCC CATTACGCAG AGCTGGTGGG CGAACTGCCC
ACGACCCAGA TGGGGTGGGA GATTTATCCA GATGGGCTTT ACAAGTTTCT GAAGCGCACA
GCCGAAGACT ACACCGGAGG TCTGCCGCTC ATCGTGACCG AGAACGGCAT GGCAAACCCG
GATGTGCTTC TGGAGGGCGA GGTGCCGGAC GCAGCCCGCA TCGCCTACGT TGAGGCACAC
CTTGCGCGGG TACGCCAAGC GATTGCAGAG GGCGTCCCGG TGAAGGGTTA CTTTCTGTGG
TCACTTCTCG ACAATTACGA ATGGGCGCTC GGGTACGAGA AACGCTTTGG TCTGGTGCAT
GTCGACTTTG AGACGCTGAA GCGCACGCCG AAGGCCTCTT ATCGTGCCTT GCAACGTGCG
CTTACCGCAT CCTGA
 
Protein sequence
MTDFTFTRRD FPGDFLFGCA TSSYQIEGHQ YGGAGPTHWD SFAATPGNVV RSEDGARACD 
HYHRFEEDLD LAAAAGFECY RFSTSWARVL PEGRGTPNAE GLDFYDRLTD AMLERGLKPC
ATLYHWELPQ PLADMGGWRN RDVSNWFAEF TEVIMSRIGD RMYSVAPINE PWCVGWLSHF
LGHHAPGLRD IRATARAMHH VLLSHGRAIE VMRGLGMNNL GAVFNFEWAE PLDQSAQAQA
AAETYDAIYN RFFLGGVFKG AYPEAALRGL EPHLPQGWQD DFATITQKVD WCGLNYYTRK
VIGPDNGPWP HYAELVGELP TTQMGWEIYP DGLYKFLKRT AEDYTGGLPL IVTENGMANP
DVLLEGEVPD AARIAYVEAH LARVRQAIAE GVPVKGYFLW SLLDNYEWAL GYEKRFGLVH
VDFETLKRTP KASYRALQRA LTAS