Gene Bind_1898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1898 
Symbol 
ID6199576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2163535 
End bp2165718 
Gene Length2184 bp 
Protein Length727 aa 
Translation table11 
GC content59% 
IMG OID641705887 
Productmalate synthase G 
Protein accessionYP_001833011 
Protein GI182678865 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01345] malate synthase G 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.305447 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTATC TTTCGATCAA CGGCCTTTCG GTCGATTCGG AACTTCACGA TTTTATCGTC 
AAGGAAGCTA TTCCCGGCTC TGGCGTCAGC GCCGAACAAT TTTGGACTGG CCTTGCCGAT
CTCGTCGCCG CGAAAAGCGC GACGAACCGC GCCTTGCTCG CCAAGCGCGA TGAATTGCAG
GCCAAGATCG ATGCTTATTA TCGCTCCGGC GGCTCAAAGG ATCCGGCTGC GACGGAAGCC
TTCCTGCGCG AGATCGGCTA TCTTCTGCCC GAGCCCGCGC CCCAGGCGAT CGAGACGCAA
AATGTCGACG ATGAAATCGC CACGATCGCC GGGCCACAGC TCGTCGTGCC GGGCTCCAAT
GCCCGCTATG CGCTGAATGC CGCCAATGCC CGTTGGGGCA GCCTCTATGA CGCGCTCTAT
GGCACGGATG CGATTCCGGA AACCAATGGC GCGGAGCGCG GCAAGGGTTT CAACCCCACG
CGTGGCGCGC TCGTCGTCGC CAAGGCCAAG ACCATTCTCG ACCAGTCCGT GCCGCTTGAC
GGCGCGTCCT ATGCGGATGT CACGGCCTAT CAGATCGAGG GGAGCACCCT GAAGGCCACG
ACGAAGGATG GCAAGAGCAT CGGCCTGAAG ACGCCCGAGC AATTCGCTGG TTTCACCGGC
ACCAAGGAAG CGCCAGCCAC CATCCTTTTG CGTCATAACC ATCTCCATTT CGAGCTCGTC
GTCAATGCCA ACCATCCGGT CGGCAAGACG GATCCGGCCG GCCTCGCCGA TGTCATCATC
GAGTCCGCCG TGTCGACCAT CATCGACATG GAAGACAGTG TTGCCGTGGT CGATGCCGCC
GATAAAGTGG CGCTTTATCG CAATATTCTC GGCCTGATGC AGGGCACGCT TTCGGACACA
TTCGTCAAGA ATGGCGCTTC GCTGACCCGC ACCCTGCATG ACGATCGGAT CTACCAGACT
GGCGATGGCG GCACCTTGCG CCTGCATGGG CGCAGCCTGC TGCTCATCCG CAATGTCGGC
CATCATATGT TCACCGACGC GGTCCTTGAT GCGTCGGGTG CCGAAATCCC GGAAAATATG
CTCGACGGCA TGGTGACAGC GCTTGTCGCG CGCCATGATC TTTATGGCAC CCATGAACAC
CATAACAGCC GCAAGGGTTC GGTCTATATC GTCAAGCCCA AAATGCATGG GCCACAGGAA
GTGGCTTTTG CCAATTCCGT GTTCGAACAT ATCGAACTGG CGATCGGTCT GCCCGCGAAT
ACGCTGAAGA TCGGCATCAT GGACGAGGAA CGGCGCACGA CCGTCAATCT GATGGCGGGG
ATCCAAGCGG CCTCCAAGCG TATTGTCTTC ATCAATACGG GCTTCCTCGA TCGCACCGGC
GACGAAATCC ATACGGCGAT GGAAGCCGGC CCCATGGTGC GCAAGAACAG CATCAAGAAC
GAAGCCTGGC TGAAAGCCTA TGAAGACAAT AATGTGGATG TCGGCCTGGC GAGTGGCCTG
CCCGGCCATG CTCAGATCGG CAAGGGCATG TGGGCCGCTC CCGATGCCAT GGCCGATATG
CTGACTCAGA AGATCGGTCA TCCGCGCGCC GGCGCCAATA CGGCCTGGGT GCCCTCGCCC
ACCGCCGCGA CCTTGCATGC GCTTCACTAT CATGAAGTGG ATGTCTTCGA GCGGCAGAAG
GAACTGGCTG GACGGCCGCG TGCCAAACTG CATGATCTTC TAACCGTGCC GGTCGCCAAA
ACGACCTATT CTGCGCAGGA GATTCAGGAA GAGCTCGACA ATAACGCCCA GGGTCTGCTC
GGCTATGTCG TGCGCTGGAT CGACCAGGGC GTCGGCTGTT CGAAAGTGCC GGACATTCAC
AATGTCGCCT TGATGGAAGA TCGCGCGACC TTGCGGATTT CCAGCCAGCA TATTGCCAAC
TGGCTATATC ACAAGATCGT CACCGAGAGC CAAGTCCTGG AAACCCTGAA GCGCATGGCG
GTCGTCGTCG ATCAGCAGAA TGCCGGCGAT TCCGCCTACC GGCCGATGGC ACCCGCCTTC
GACGGCATCG CTTTCAAAGC GGCCTGCGAT CTCGTCTTTA AGGGACGTGA ACAGCCAAAC
GGCTATACGG AATTCCTGCT GCATTCACGC CGTCGCGAGG TCAAGGCCGA ACAGGCCCAA
AGTGGCCAAG CCTCTGCGGC TTAA
 
Protein sequence
MGYLSINGLS VDSELHDFIV KEAIPGSGVS AEQFWTGLAD LVAAKSATNR ALLAKRDELQ 
AKIDAYYRSG GSKDPAATEA FLREIGYLLP EPAPQAIETQ NVDDEIATIA GPQLVVPGSN
ARYALNAANA RWGSLYDALY GTDAIPETNG AERGKGFNPT RGALVVAKAK TILDQSVPLD
GASYADVTAY QIEGSTLKAT TKDGKSIGLK TPEQFAGFTG TKEAPATILL RHNHLHFELV
VNANHPVGKT DPAGLADVII ESAVSTIIDM EDSVAVVDAA DKVALYRNIL GLMQGTLSDT
FVKNGASLTR TLHDDRIYQT GDGGTLRLHG RSLLLIRNVG HHMFTDAVLD ASGAEIPENM
LDGMVTALVA RHDLYGTHEH HNSRKGSVYI VKPKMHGPQE VAFANSVFEH IELAIGLPAN
TLKIGIMDEE RRTTVNLMAG IQAASKRIVF INTGFLDRTG DEIHTAMEAG PMVRKNSIKN
EAWLKAYEDN NVDVGLASGL PGHAQIGKGM WAAPDAMADM LTQKIGHPRA GANTAWVPSP
TAATLHALHY HEVDVFERQK ELAGRPRAKL HDLLTVPVAK TTYSAQEIQE ELDNNAQGLL
GYVVRWIDQG VGCSKVPDIH NVALMEDRAT LRISSQHIAN WLYHKIVTES QVLETLKRMA
VVVDQQNAGD SAYRPMAPAF DGIAFKAACD LVFKGREQPN GYTEFLLHSR RREVKAEQAQ
SGQASAA