Gene Bind_2834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2834 
Symbol 
ID6198993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3229588 
End bp3231429 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content60% 
IMG OID641706781 
Productthiamine pyrophosphate protein central region 
Protein accessionYP_001833892 
Protein GI182679746 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3962] Acetolactate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATGG TCCGCTGCAC CATGGCGCAG GCCCTGGTTC GTTATCTTTG CAATCAATTC 
ACCATCGTTA ACGGCCAGCG TGTGCCGCTC TTTCCCGGTG TCTTCGCAAT CTTTGGCCAC
GGCAATGTCA CTTGCCTCGC GGAAGCGCTG GAAGCTGTTC AGGATAAGCT GCCGACCTGG
CGCGGCCAGA ATGAGCAATC CATGGCGCTG GCAGCGATCG GTTTTGCCAA GGCGGTCCGG
CGCCGGCAGA TCATGGTCGC GACCAGTTCC ATTGGCCCCG GCGCTCTCAA CATGGTGACG
GCCGCCGGTG TCGCCCACAC GAACCGCCTG CCGGTGCTGT TGCTCGCTGG CGACACTTTC
GTCAATCGCC GGCCAGATCC AGTCATGCAG CAGGTCGAGC ATTTCGGTAA TCCGACCATC
ACCGTCAATG ATGCTTTCAA GGCGGTCACC CGTTACTGGG ACCGTATCGT CCACCCCGAG
CAGGTCATCT CATCCTTGCC GCAGGCGGTC GCTGCCATGC TCGATCCGGC CGATTGCGGT
CCGGCTTTCA TCGCCCTCCC CCAGGACGTG CAGGAAATGG CTTGGGACTA TCCCGAAGCC
TTTTTCGCGG AAACGGTGCA TAACATTCCA CGGCCGCGCC CGGATCGTGG GCGCCTCGAT
GAAGCAGCCT CTCTTCTCAA GAATGCACAG CGGCCGCTGA TCATTTCAGG AGGCGGTGTA
CGTTATTCCG GCGCCGAAGA TGCCCTCGCC GCTTTCGCCG CGAAACACGG CGTTCCGCTG
TGTGAAACCA TTGCCGGCAA GGGGAGTGTT TCGCATGACC ATCCCGTTCA TGTCGGGCCG
ATCGGCATTG TCGGCTCGAC ATCGGCCAAT GCCATGGCCG CGGAAGCGGA TGTGATTCTC
GCTGTCGGCA CGCGGCTCAT GGATTTCACG ACAGGCTCCT GGTCGTCTTT CCGGCAGGAC
GCCAAATTCA TCACTGTCAA TACGGCGCGC TGGGATGCGA CCAAACATCG GGCGCTTGCT
GTAGTCGGCG ATGCGCTCGA AACAGTAAAG GAACTCGATC AAAACCTTGA TGGCTGGACG
GCCGATACCG CCTGGACCGA ACAGGGAAAG ATCGAATTCG CGAAATGGAA TGTGGCGCTC
GACGGTTTCC AAAAGCCGAC CAACGATCCA ATTCCCACTT ATGCGCAGGT CATCGGCGTG
GTGAACGCCA AGGCTGGCGA ATATGATCTC CTGATCACGG CGGCGGGCGG CCTACCCGGC
GAAGTCATGA AGAACTGGCG TGTGAAGGCG CCGAATACAT TCGATTGCGA ATTCGGCTTT
TCCTGCATGG GCTATGAAAT CCCTGCTGGC TGGGGCGCCG CCATGGCCGA TCCCACACGC
ACACCCATCG TCATGATCGG CGATGGCACA TACATGATGA TGAATTCGGA TATCTATTCC
TCGGTTCTGT CAGGGCACAA GATCATTCTC ATCGTCTGCG ACAACGGCGG TTATGCCGTC
ATCAATCGTC TGCAAAACGC CAAGGGCGGT GCCTCCTTCA ACAATCTCCT CAAGGATTGT
CGGGTGAAGG AGCCCTTCGC GGTGGACTTC AACAAACATG CAGAAGCGAT GGGTGCTCTG
ACGCGGCGGG TGGAAAGTCT CGCCGATCTC GGCCAGGCCG TGGAATGGGC GAAGACCACC
GACCGCACCA CCGTCATTAC CATCGTTTCC GACGCCTTCA CCTGGACCCC GGGCGACGCC
TGGTGGGATG TGGGCGTGCC GCAAGTGAGT GTCCGCGCGG AAGTCAGTAA TGCCGCGCAA
CAGCAGCAGG AAGGACGGAC CCGCCAGCGC GTCGGCGTCT GA
 
Protein sequence
MAMVRCTMAQ ALVRYLCNQF TIVNGQRVPL FPGVFAIFGH GNVTCLAEAL EAVQDKLPTW 
RGQNEQSMAL AAIGFAKAVR RRQIMVATSS IGPGALNMVT AAGVAHTNRL PVLLLAGDTF
VNRRPDPVMQ QVEHFGNPTI TVNDAFKAVT RYWDRIVHPE QVISSLPQAV AAMLDPADCG
PAFIALPQDV QEMAWDYPEA FFAETVHNIP RPRPDRGRLD EAASLLKNAQ RPLIISGGGV
RYSGAEDALA AFAAKHGVPL CETIAGKGSV SHDHPVHVGP IGIVGSTSAN AMAAEADVIL
AVGTRLMDFT TGSWSSFRQD AKFITVNTAR WDATKHRALA VVGDALETVK ELDQNLDGWT
ADTAWTEQGK IEFAKWNVAL DGFQKPTNDP IPTYAQVIGV VNAKAGEYDL LITAAGGLPG
EVMKNWRVKA PNTFDCEFGF SCMGYEIPAG WGAAMADPTR TPIVMIGDGT YMMMNSDIYS
SVLSGHKIIL IVCDNGGYAV INRLQNAKGG ASFNNLLKDC RVKEPFAVDF NKHAEAMGAL
TRRVESLADL GQAVEWAKTT DRTTVITIVS DAFTWTPGDA WWDVGVPQVS VRAEVSNAAQ
QQQEGRTRQR VGV