Gene Lferr_1739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1739 
Symbol 
ID6877721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1715091 
End bp1716947 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content70% 
IMG OID642789608 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_002220168 
Protein GI198283847 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.158802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000847686 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTGCCGCC ACGACATGCC TTTCGGCGCC CAGCGCCTCG AAGACGGGCG CTGGCGGTTC 
CGGCTCTGGG CACCTGCGGC GGGAACGGTG GAACTCCAGC TCCCGCACGG CACAGGGGTA
CGGTACACGC CTATGCGGCC GGAGACCGAG GGTTGGTTCG CGCTGGAAAC GGGGGCGGCT
CCGGGGACCG CTTATGCCTA TCGCATCAAT AGGGACCTCG TCGTCCCGGA CCCCGCATCC
CGCGCCCAGC TCGACGATGT TCACGGCCCG AGCCTGCTGG TGGACCCCGC TGCCTTTGCC
TGGGAAGACG GCCATTGGCG GGGGCGGCCC TGGGAAGAGG CCGTGATCTA CGAGCTGCAT
ACCGGCACCT TCTCGTCCGA GGGCACTTTC GGCGGGATCA CCGCGCGCCT GGACCAGCTG
GCCGGGCTGG GCGTCACGGC CCTGGAGCTC ATGCCCGTGG CGGACTTTCC CGGCGAGCGG
GACTGGGGCT ACAACGGGGC GCTGCCCTTC GCGCCGGATC GGCGCTACGG CACGCCCGAC
GACCTGAAAC GCCTGGTGCA GGGAGCTCAC CGGCGTGGCC TCATGGTATT CCTCGACGTC
GTGTACAACC ATTTCGGACC CGTGGGAAAT TATCTGTCCC ATTACGCCCC CGCCTTCTTC
ACCGACCGCC ACCACACCCC CTGGGGGGCG GCCATCAATT TCGATGGCCC CGGAAGCCGC
ACGGTGCGCG CGTTTTTCAT CCACAACGCC CTGTACTGGC TGGAAGAGTT TCACATGGAC
GGCCTGCGCC TCGACGCCGT GCATGCCATC TGCGACGACT CGGACCCGGA CATCATCGAG
GAGCTGGGCG CGGCGGTCGC CCGGCGCTTT CCGCACCGCC CCATACACCT GATGCTGGAG
GATGACCGCA ACAGCGCCCA CTACCTGGTC CCGGAGGGTT CACGGCGCAT ATATGCCGCC
CAATGGAACG ATGATTTCCA CCACGCCCTC CACGTCATCC TGACCGGTGA GGCGGAGGGC
TACTACGGGG ACTTCGCCGA CGATCCCCAT GGGCTCCTCG CCCGCTGCCT GGCCGAAGGC
TTCGCCTTCC AGGGTGCCTG GTCCGTCTAC CACGGGCGGC GCCGCGGCGA GCCGAGTGCC
GGTCTCCCGG TCACCGCCTT CATCGGCTTT CTCCAGAACC ACGACCAGGT GGGCAACCGG
GCCTTCGGCG ATCGTCTGGC GACCCTCGCA ACGCCGGAGG CCGTGCGGGC GGCGACGGTG
CTGCTGTTGC TGGCGCCGGC CCCGCCATTG CTCTTCATGG GCCAGGAGTG GGGCAGCCGG
CGCCCATTCC CGTTCTTCTG CGACCTGGGA CCCGACCTCG CGCCGCAGGT GCGGGAAGGC
CGGCTGCGCG AGTTCGCACG CTTCCCGGAC TTCGCCGACG CCGCTGGCCG CGCCCGCATC
CCCGATCCCA GCGACCCGGC GACCTTCCGG TCCGCCCGGC TCGACTGGCG CGAAAGGGAG
CAGGAAACGC ACCGAGCCTG GCTGGCCCTG CATCGCGAGC TGCTCGCCCT TCGCCAGCGC
GAGCTGGTGC CGCGTCTGGC CGGCGTCACC GTGGAGGGGG CGGTGGCGGC CCATTTCGGC
AAAGGCGGCG TGACGGCACG CTGGCGCCTC GCCGATAGGA TCGTCCTGGT GGTCGTCGCC
AACCTGGACG CGGAGCCGGC CGCCCACCTT CGCCTTCCCA CCGGCAAGCA GCTTTACGCC
ACACCAGCGG CATCCGGACC CGACGGCTCC TTCGCACCCT GGGCGGTGGC TTTCTTCCTG
TGTGCCCCCC ATGACGCCGA GGTCGCCGTG GACGGAACAT CGCCGTTCGG CCACTGA
 
Protein sequence
MCRHDMPFGA QRLEDGRWRF RLWAPAAGTV ELQLPHGTGV RYTPMRPETE GWFALETGAA 
PGTAYAYRIN RDLVVPDPAS RAQLDDVHGP SLLVDPAAFA WEDGHWRGRP WEEAVIYELH
TGTFSSEGTF GGITARLDQL AGLGVTALEL MPVADFPGER DWGYNGALPF APDRRYGTPD
DLKRLVQGAH RRGLMVFLDV VYNHFGPVGN YLSHYAPAFF TDRHHTPWGA AINFDGPGSR
TVRAFFIHNA LYWLEEFHMD GLRLDAVHAI CDDSDPDIIE ELGAAVARRF PHRPIHLMLE
DDRNSAHYLV PEGSRRIYAA QWNDDFHHAL HVILTGEAEG YYGDFADDPH GLLARCLAEG
FAFQGAWSVY HGRRRGEPSA GLPVTAFIGF LQNHDQVGNR AFGDRLATLA TPEAVRAATV
LLLLAPAPPL LFMGQEWGSR RPFPFFCDLG PDLAPQVREG RLREFARFPD FADAAGRARI
PDPSDPATFR SARLDWRERE QETHRAWLAL HRELLALRQR ELVPRLAGVT VEGAVAAHFG
KGGVTARWRL ADRIVLVVVA NLDAEPAAHL RLPTGKQLYA TPAASGPDGS FAPWAVAFFL
CAPHDAEVAV DGTSPFGH