Gene Bind_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3801 
Symbol 
ID6198019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010580 
Strand
Start bp110490 
End bp112127 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content59% 
IMG OID641703934 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_001831086 
Protein GI182676939 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.382943 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCCG ATCAGAATGT TTCTGCCGAT GTCGTGATCG TCGGCTCCGG TGTCGCGGGC 
AGCTCCATTG CTAATGAATT GGCGCGGGCC GGAATTTCCG TCATCGTTCT TGAAGCTGGC
CCCCGCGTGG ACCGTCAGCA TTTCGTTGAG AACTTTCGTA ACCTTGAGAA CAAGCCGTCC
TATCAGGGGC CGTTCCCGTC CACACCCTGG GCTCCGCATC CGCCGAACCA GATGACGCCC
AACCAATACC TGCATACGAC GGGTCCAAAT GCCGAGGCCT ATCAGCAGGT CTATCTTCGA
ATGATCGGCG GTACGACGTG GCACTGGGCC GGATGTGCCT GGCGCTTTCT CCCCTCCGAT
TTCGAACTCA AGACCCGTTA CGGGCAGGGG CGCGACTGGG CGCTGAAATA CGATGACCTC
GAGCCGTTCT ATTACCAGGC TGAGGTTATG ATGGGTGTCT GCGGACCGGA CCCTAAGATT
GAGGATCTTG GCTCTCCGCG TAAGCAGCCC TACCCCATGT CGGCGCTGCC CATTTCCTAC
GCCGCGCAGC AGTTCCGCAA GCTCATCAGC AAGCAGACGC CATGGCGCAT CGTGCATGAG
CCACAGGCCC GCAATACGCA GCCCTATGAC GGGCGTCCCA CCTGCGAAGG CCATAACAAC
TGCATGCCGA TCTGCCCGAT CGGAGCCATG TATAACGGCA GCTATTCCGT CTATCACGCA
GAGGCCGCCG GGGCGACGTT CATCCCCAAT GCTGTCGCCT ACAGGATCGA GCGTGATGCC
GCCAACAGGA AAGTGACGGC GGTTCACTAT TACGATCCGG ATAAAGGGTC GCACCGCGTC
GCGGGAAAGT ATTTCGTCAT TGCCGCGCAC TGCATCGAGA CGGCGAAGCT GCTTCTCGTC
TCGGCGGATG ACAAGAGCCC AGACGGTGTT GCGAATAGCT CGAGCCATGT CGGCCGGAAC
ATGATGGACC ATACCGGGGT GCAAGTCACG TTCATCAGTG GCGATAAAGC GCTCTGGCCC
GGTCGTGGCC CGCTTTTGAC GAACGTGATC GACACCTTTC GCGACGGCGA TTGGCGTGGG
GAGCACGGCG CCTATCTGGT GCATATGGTG GACGATAACC AAGTGGACCT CGCGACGCAG
CTCGCGATCT CCAAGGGGTA TGTCGGACAC GATCTGGAAG AACAGATCCG CTATCTGGCC
TCCCATACCG TTCGTCTGTT CAGCCATAAC GAGGCCTTGC CGGATCCCGA CAACCGCCTG
ACCCTCAGCA AGACGCACAA GGACGCGCTC GGTATCCCGC ATCCGGAAGT CTATTATAAG
CTGCCAGACT ATACGGTGCG AAGCTGCGAG CATACGCGTG GTGTGTTCAG GCAACTCATC
GGTCTTATGC ACGGAACCGA TGAGCAATGG ACGCCGGGAT ATTTCCCGCA GGACCATCCC
TCTGGAAGTA CCATCATGGG CGCGGACCCC AGGGATTCCG TGGTGGATGG CCATTGCCGG
ACGCACGACC ATGAGAATCT GTTCATCGCA AGCTCGTCTG TCTTCTCAAC GGTCGGGACG
GGCAACATCA CCCTGACAGT AGCCGCCCTC GCGCTTCGTG TTGCTGATAT GCTGAAAAGA
GAACTACGCC ATGCCTGA
 
Protein sequence
MSSDQNVSAD VVIVGSGVAG SSIANELARA GISVIVLEAG PRVDRQHFVE NFRNLENKPS 
YQGPFPSTPW APHPPNQMTP NQYLHTTGPN AEAYQQVYLR MIGGTTWHWA GCAWRFLPSD
FELKTRYGQG RDWALKYDDL EPFYYQAEVM MGVCGPDPKI EDLGSPRKQP YPMSALPISY
AAQQFRKLIS KQTPWRIVHE PQARNTQPYD GRPTCEGHNN CMPICPIGAM YNGSYSVYHA
EAAGATFIPN AVAYRIERDA ANRKVTAVHY YDPDKGSHRV AGKYFVIAAH CIETAKLLLV
SADDKSPDGV ANSSSHVGRN MMDHTGVQVT FISGDKALWP GRGPLLTNVI DTFRDGDWRG
EHGAYLVHMV DDNQVDLATQ LAISKGYVGH DLEEQIRYLA SHTVRLFSHN EALPDPDNRL
TLSKTHKDAL GIPHPEVYYK LPDYTVRSCE HTRGVFRQLI GLMHGTDEQW TPGYFPQDHP
SGSTIMGADP RDSVVDGHCR THDHENLFIA SSSVFSTVGT GNITLTVAAL ALRVADMLKR
ELRHA