Gene Bind_2204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2204 
Symbol 
ID6199758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2524033 
End bp2526420 
Gene Length2388 bp 
Protein Length795 aa 
Translation table11 
GC content58% 
IMG OID641706194 
Productputative phosphoketolase 
Protein accessionYP_001833312 
Protein GI182679166 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3957] Phosphoketolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.221992 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGATC GCGCGCACGG TCCGGCCGGA ATCAGTCAGC CTCTCTCGCC GGATCTGTTG 
CAGCGTCTCA ATGCCTGGTG GCGCGCCGCC AATTATCTCT CGGTGGCGCA GCTCTACCTT
CTCGACAATC CGCTGTTGCG GCAAAAGCTC ACCCTGGATC ATATCAAGCC GCGCCTGCTC
GGCCATTGGG GCACGACACC GGGGCTCAAT TTCATCTATG TGCATCTCAA CAGGATCATC
AAGGAACGCG ATCTCGATAT TCTGTTCATT GCTGGCCCCG GGCATGGCGC GCCGGGGCTC
ATTGCCAATA GCTGGCTGGA AAAGACCTAT AGCGAGGTCT ATCCCGCTGT TTCGCAAGAT
ATCGAAGGCA TGACCCGGCT ATGCCGCCAA TTTTCATTCC CTGTCGGCAT CCCGAGCCAT
GCGGCGCCGG AAACACCCGG CTCCATCCAT GAAGGCGGCG AGCTTGGCTA TTCGCTCTCC
CATGCCTTTG GCGCGGTCTT TGACAACCCC GACCTGATCG CGGCCTGCGT GATCGGAGAC
GGCGAGGCGG AAACCGGCCC ACTCGCCACG TCATGGCATG CCAATAAATT TCTCGATGCC
GCGCGTGATG GCGCCGTGCT GCCGATCCTT CATCTCAACG GCTATAAAAT CGCCAATCCC
ACCGTGCTCG GGCGCATCCC GTCCGATGAA TTGGAAAATC TGCTGCAAGG CTATGGCTAT
GCGCCCCTTT TCGTCGAAGG CGACGACCCG GACATTATGC ATCGGCACAT GGCCGAAGCA
CTAAATGTCG CTTTTGCGGG TATTGCCAAA ATCCAGCGAG CCGCACGTGT CGAGGGCCAT
ATCGAGCGGC CACGCTGGCC CATGCTGGTG CTCCGCAGCC CCAAGGGCTG GACCGGCCCC
AAAACGGTTG ACGGGTTGAA GACGGAAGGA TTTTGGCGTG CCCATCAGGT CCCATTCACC
ATCGCCGACA AGCCTGAACA TCTCTCGCTT CTCGAAAGCT GGCTGAAAAG CTACCGACCG
GAGGAGCTTT TCGAGGAAAG CGGCGCCCTC AAGCCGGAGA TTGCTTCCCT GGCCCCCAGC
GGGGAGCGCC GAATGAGCGC CAATCCGCAG GCCAATGGCG GCAACTTGCC ACGGCCGTTG
CAAATACCGG ATTTTACCCT GCATGCCGTC GATGTCGATT ACCCCGGCGC ACGAACGGCA
GAGGCGACTT TCGTTATGGG GCAATTTCTG CGCGACATCA TGCGCGCCAA TGAGACAAAC
AAGAATTTCC GCGTCTTCGG GCCGGATGAA ACCGCCTCCA ACCGTTTGCA GGCGCTCTAC
GACGTCACGG ACAAGACCTG GAATGCCGCC TTCATCGCGG AGGATGAACA TCTCGACCCG
ACGGGCCGCG TCATGGAAAT CCTCAGCGAA CATACATGCC AAGGCTGGCT TGAAGGCTAT
CTGCTCACGG GGCGCCATGG CCTCATGTCT TGTTATGAGG CCTTCATTCA TATTGTTGAT
TCCATGGTCA ATCAGCACGC CAAATGGCTC AAGACGGCGC GTGGTGTCCC ATGGCGTCGG
CCCATCGCCT CGCTCAATTA TCTTCTAACG TCTCATGTCT GGCGCCAGGA TCATAATGGC
TTTAGCCATC AGGATCCCGG TTTCATCGAT CATATCGCCA ACAAGAAATC CGATATCGCG
CGCGTTTATC TACCGCCGGA TGCCAATTGC CTTCTCTATA TCACCGATCA TTGTCTGCGC
AGCTGGAACA GGATCAATGT CATTGTCGCC GGCAAACAGC CGGAACCGCA ATGGCTCGAC
ATGGAGGAAG CGATCAGCCA TTGCCGCGCC GGTCTCGGCA TCTGGAGTTT CGCTAGCAAT
GATCAAGATA GCGAGCCCGA TGTGGTCCTC GCCTGCGCCG GCGACGTGCC AACGCTCGAA
ACCCTCGCCG CAGCCGATTT CTTGCATACA CATCTGCCGG AGGTGAAAGT CCGTGTCGTC
AATATCGTTG ATCTGTTTGC GCTAGAGCCG CAAACACGCC ATCCGCATGG CTGGAGCGAT
AAGGATTTTG ATACGCTGTT CACCAAGGAC AAACCGGTGA TCTTCGCCTA TCATGGTTAT
CCTTCCCTCA TCCATCGGCT GATCTACAAG CGGACCAACC ATTCCAATTT CCACGTCCAC
GGCTATCAGG AAGAGGGTTC AACGACGACA CCCTTCGATA TGGTGGTGCG CAACCGGCTC
GATCGTTTTC ATCTCGTCGG CGACGTTGTC GATCATCTAC CGAAGCTCGG CGCCAAGGCG
GCCTATGTCA AACAATGGCT GCGTGACAAG TTCGCCGAGC ACGAACGCTA TATCGTGGCG
CATGGCGAGG ATTTGCCGGA GATCCGTCAT TGGCGCTGGC CAGAATGA
 
Protein sequence
MDDRAHGPAG ISQPLSPDLL QRLNAWWRAA NYLSVAQLYL LDNPLLRQKL TLDHIKPRLL 
GHWGTTPGLN FIYVHLNRII KERDLDILFI AGPGHGAPGL IANSWLEKTY SEVYPAVSQD
IEGMTRLCRQ FSFPVGIPSH AAPETPGSIH EGGELGYSLS HAFGAVFDNP DLIAACVIGD
GEAETGPLAT SWHANKFLDA ARDGAVLPIL HLNGYKIANP TVLGRIPSDE LENLLQGYGY
APLFVEGDDP DIMHRHMAEA LNVAFAGIAK IQRAARVEGH IERPRWPMLV LRSPKGWTGP
KTVDGLKTEG FWRAHQVPFT IADKPEHLSL LESWLKSYRP EELFEESGAL KPEIASLAPS
GERRMSANPQ ANGGNLPRPL QIPDFTLHAV DVDYPGARTA EATFVMGQFL RDIMRANETN
KNFRVFGPDE TASNRLQALY DVTDKTWNAA FIAEDEHLDP TGRVMEILSE HTCQGWLEGY
LLTGRHGLMS CYEAFIHIVD SMVNQHAKWL KTARGVPWRR PIASLNYLLT SHVWRQDHNG
FSHQDPGFID HIANKKSDIA RVYLPPDANC LLYITDHCLR SWNRINVIVA GKQPEPQWLD
MEEAISHCRA GLGIWSFASN DQDSEPDVVL ACAGDVPTLE TLAAADFLHT HLPEVKVRVV
NIVDLFALEP QTRHPHGWSD KDFDTLFTKD KPVIFAYHGY PSLIHRLIYK RTNHSNFHVH
GYQEEGSTTT PFDMVVRNRL DRFHLVGDVV DHLPKLGAKA AYVKQWLRDK FAEHERYIVA
HGEDLPEIRH WRWPE